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METHODS AND COMPOSITIONS FOR INHIBITION OF MEMBRANE 
FUSION- ASSOCIATED EVENTS , INCLUDING HIV TRANSMISSION 

5 

1. INTRODUCTION 
The present invention relates, first, to DP178 
(SEQ ID NO:l), a peptide, also referred to herein as 
T2 0, corresponding to amino acids 63 8 to 673 of the 
HIV-Ijaj transmembrane protein (TM) gp41, and portions 

10 

or analogs of DP178 (SEQ ID N0:1), which exhibit anti- 
membrane fusion capability, antiviral activity, such 
as the ability to inhibit HIV transmission to 
uninfected CD-4 + cells, or an ability to modulate 
intracellular processes involving coiled-coil peptide 

15 structures. The present invention also relates to 

peptides analogous to DP107 (SEQ ID NO:25) , a peptide, 
which is also referred to herein as T21, corresponding 
to amino acids 558 to 595 of the HIV-l^, transmembrane 
protein (TM) gp41, having amino acid sequences present 

20 in other viruses, such as enveloped viruses, and/or 
other organisms, and further relates to the uses of 
such peptides. These peptides exhibit anti-membrane 
fusion capability, antiviral activity, or the ability 
to modulate intracellular processes involving coiled- 
coil peptide structures. 

25 

The gp41 region from which DP107 is derived is 
referred to herein as HR1. The gp41 region from which 
DP178 is derived is referred to herein as HR2 . As 
discussed herein, the gp41 HR1 and HR2 regions 
interact (non-covalently) with each other and/or with 
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T20 and T2l peptides. This interaction is required 
for normal infectivity of HIV. 

The present invention therefore additionally 
relates to methods for identifying compounds, 
including small molecule compounds, that disrupt the 
5 interaction between DP178 and DP107, and/or between 
DP107-like and DP178-like peptides. In one 
embodiment, such methods relate to identification and 
utilization of modified DP178, DP178-like, DP107 and 
DPl07-like peptides and peptide pairs that interact 
10 with each other at a lower affinity than the affinity 
exhibited by corresponding "parent" or "native" 
peptides. Further, the invention relates to the u*ft 
of DP178, DP178 portions, DP107, DP017 portions aai/bi 
analogs and other modulators, including small 
is molecules modulators, of DP178/DP107, 

DP178-like/DPl07-like, or HR1/HR2 interactions as 
antifusogenic or antiviral compounds or as inhibitors 
of intracellular events involving coiled- coil peptide 
structures. The invention is demonstrated, first, by 
way of an Example wherein DP178 (SEQ ID:l), and a 
20 peptide whose sequence is homologous to DP178 are each 
shown to be potent, non-cytotoxic inhibitors of HIV-1 
transfer to uninfected CD-4* cells. The invention is 
further demonstrated by Examples wherein peptides 
having structural and/or amino acid motif similarity 
25 to DP107 and DP178 are identified in a variety of 

viral and nonviral organisms, and in examples wherein 
a number of such identified peptides derived from 
several different viral systems are demonstrated to 
exhibit antiviral activity. The invention is still 
3Q further demonstrated by way of Examples wherein other 
DPi78-like and DPi07-like peptides are identified that 
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interact with their corresponding HR1 and HR2 domains 
with a lower affinity than the affinity exhibited by 
the native DP178 or DP107 peptide from which they are 
derived. 

5 2 . BACKGROUND OF THE INVENTION 

2 . 1 MEMBRANE FUSION EVENTS 
Membrane fusion is a ubiquitous cell biological 
process (for a review, see White, J.M., 1992, Science 
258 : 917-924) . Fusion events which mediate cellular 
10 housekeeping functions, such as endocytosis, 

constitutive secretion, and recycling of membrane 
components, occur continuously in all eukaryotic 
cells. 

Additional fusion events occur in specialized 
cells. Intracellularly, for example, fusion events 
are involved in such processes as occur in regulated 
exocytosis of hormones, enzymes and neurotransmitters. 
Intercellularly, such fusion events feature 
prominently in, for example, sperm-egg fusion and 
myoblast fusion. 

2 0 Fusion events are also associated with disease 

states. For example, fusion events are involved in 
the formation of giant cells during inflammatory 
reactions, the entry of all enveloped viruses into 
cells, and, in the case of human immunodeficiency 

25 virus (HIV), for example, are responsible for the 

virally induced cell-cell fusion which leads to cell 
death. 

2.2. THE HUMAN IMMUNODEFICIENCY VIRUS 
3Q The human immunodeficiency virus (HIV) has been 

implicated as the primary cause of the slowly 
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degenerative immune system disease termed acquired 
immune deficiency syndrome (AIDS) (Barre-Sinoussi, F. 
et_al., 1983, Science 210:868-870; Gallo, R. et al 
1984, Science 224:500-503). There are at least two 
distinct types of HIV: HIV-1 (Barre-Sinoussi, F. et 
5 al^, 1983, Science 220:868-870; Gallo R. et al . . 1984, 
Science 224:500-503) and HIV-2 (Clavel, F. et al. , 
1986, Science 233:343-346; Guyader, M. et al . . 1987, 
Nature 326:662-669). Further, a large amount of 
genetic heterogeneity exists within populations of 
10 each of these types. Infection of human CD- 4* T- 
lymphocytes with an HIV virus leads to depletion of 
the cell type and eventually to opportunistic 
infections, neurological dysfunctions, neoplastic 
growth, and ultimately death. 
i5 HIV is a member of the lentivirus family of 

retroviruses (Teich, N. et al. . 1984, RNA Tumor 
Viruses, Weiss, R. et al. . eds . , CSH- Press, pp. 949- 
956) . Retroviruses are small enveloped viruses that 
contain a diploid, single-stranded RNA genome, and 
replicate via a DNA intermediate produced by a 
virally-encoded reverse transcriptase, an RNA- 
dependent DNA polymerase (Varmus, H. , 1988, Science 
240:1427-1439). Other retroviruses include, for 
example, oncogenic viruses such as human T-cell 
leukemia viruses (HTLV-I, -II, -m) , and feline 
25 leukemia virus. 

The HIV viral particle consists of a viral core, 
composed of capsid proteins, that contains the viral 
RNA genome and those enzymes required for early 
replicative events. Myristylated Gag protein forms an 
outer viral shell around the viral core, which is, in 
turn, surrounded by a lipid membrane enveloped derived 
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from the infected cell membrane. The HIV enveloped 
surface glycoproteins are synthesized as a single 160 
Kd precursor protein which is cleaved by a cellular 
protease during viral budding into two glycoproteins, 
gp41 and gpl20. gp41 is a transmembrane protein and 
gpl20 is an extracellular protein which remains non- 
covalently associated with gp4l, possibly in a 
trimeric or multimeric form (Hammarskjold, M. and 
Rekosh, D. , 1989, Biochem. Biophys. Acta 989:269-280) . 

HIV is targeted to CD-4* cells because the CD-4 
cell surface protein acts as the cellular, receptor for 
the HIV-1 virus (Dalgleish, A. et al . . 1984, Nature 
312:763-767; Klatzmann et al. . 1984, Nature 312:767- 
768; Maddon et al . , 1986, Cell 47 : 333 -348) . Viral 
entry into cells is dependent upon gpl2 0 binding the 
cellular CD-4* receptor molecules (McDougal, J.S. et 
al^, 1986, Science 231:382-385; Maddon, P.J. et al , . 
1986, cell 47:333-348) and thus explains HIVs tropism 
for CD-4* cells, while gp41 anchors the enveloped 
glycoprotein complex in the viral membrane. 



2.3. HIV TREATMENT 
HIV infection is pandemic and HIV associated 
diseases represent a major world health problem. 
Although considerable effort is being put into the 
successful design of effective therapeutics, currently 
no curative anti-retroviral drugs against AIDS exist. 
In attempts to develop such drugs, several stages of 
the HIV life cycle have been considered as targets for 
therapeutic intervention (Mitsuya, H. et al. . 1991, 
FASEB J. 5:2369-2381). For example, virally encoded 
reverse transcriptase has been one focus of drug 
development. A number of reverse- transcriptase- 
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targeted drugs, including 2', 3'-dideoxynucleoside 
analogs such as AZT, ddl, ddc, and d4T have been 
developed which have been shown to been active against 
HIV (Mitsuya, H. et al., 1991, science 249:1533-1544). 
While beneficial, these nucleoside analogs are not 
5 curative, probably due to the rapid appearance of drug 
resistant HIV mutants (Lander, B. et al. . 1989, 
Science 243:1731-1734). in addition, the drugs often 
exhibit toxic side effects such as bone marrow 
suppression, vomiting, and liver function 
10 abnormalities. 

Attempts are also being made to develop drugs 
which can inhibit viral entry into the cell, the 
earliest stage of HIV infection. Here, the focus has 
thus far been on CD4, the cell surface receptor for 
is HIV. Recombinant soluble CD4, for example, has been 
shown to inhibit infection of CD-4* T-cells by some 
HIV-l strains (Smith, D.H. et al. . 1937, Science 
238:1704-1707) . Certain primary HIV-l isolates, 
however, are relatively less sensitive to inhibition 
by recombinant CD-4 (Daar, E. et al , . 1990, Proc. 
20 Natl. Acad. Sci. USA 87:6574-6579) . In addition, 

recombinant soluble CD-4 clinical trials have produced 
inconclusive results (Schooley, R. et al. . 1990, Ann. 
int. Med. 112:247-253; Kahn, J.o. et al. . 1990, Ann. 
Int. Med. 112:254-261; Yarchoan, R. et al . . 1989, 
25 Proc. Vth Int. Conf. on AIDS, p. 564, MCP 137). 

The late stages of HIV replication, which involve 
crucial virus -specific secondary processing of certain 
viral proteins, have also been suggested as possible 
anti-HIV drug targets. Late stage processing is 
3Q dependent on the activity of a viral protease, and 
drugs are being developed which inhibit this protease 
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(Erickson, J., 1990, Science 249 : 527-533) . The 
clinical outcome of these candidate drugs is still in 
question. 

Attention is also being given to the development 
of vaccines for the treatment of HIV infection. The 
5 HIV-l enveloped proteins (gpi60, gpl20 / gp41) have 
been shown to be the major antigens for ant i -HIV 
antibodies present in AIDS patients (Barin, et al. , 
1985, Science 228 :1094-1096) . Thus far, therefore, 
these proteins seem to be the most promising 

10 candidates to act as antigens for ant i -HIV vaccine 
development. To this end, several groups have begun 
to use various portions of gpl60, gpl20, and/or gp41 
as immunogenic targets for the host immune system. " " 
See for example, Ivanoff, L. et al. . U.S. Pat. No. 

is 5,141,867; Saith, G. et al . , WO 92/22,654; Shafferman, 
A., WO 91/09,872; Formoso, C. et al . . WO 90/07,119. 
Clinical results concerning these candidate vaccines, 
however, still remain far in the future. 

Thus, although a great deal of effort is being 
directed to the design and testing of ant i -retroviral 

20 drugs, a truly effective, non-toxic treatment is still 
needed. 



3 . SUMMARY OF THE INVENTION 
25 The present invention relates, first, to DP178, a 

36-amino acid synthetic peptide, also referred to 
.herein as T20, corresponding to amino acids 638 to 673 
of the transmembrane protein (TM) gp41 from the HIV-l 
isolate LAI (HIV-l^) , which exhibits potent anti- 
3o HIV-l activity. The gp41 region from which DP178 is 
derived in referred to herein as HR2 . 
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The invention further relates to those portions 
and analogs of DP178 which also show such antiviral 
activity, and/or show anti-membrane fusion capability, 
or an ability to modulate intracellular processes 
involving coiled-coil peptide structures. The term 
5 "DP178 analog" refers to a peptide which contains an 
amino acid sequence corresponding to the DP178 peptide 
sequence present within the gp41 protein of HIV-1^, 
but found in viruses and/or organisms other than HIV- 
W Such DP178 analog peptides may, therefore, 
10 correspond to DP178-like amino acid sequences present 
in other viruses, such as, for example, enveloped 
viruses, such as retroviruses other than HIV-l^, as 
well as non- enveloped viruses. Further, such 
analogous DP178 peptides may also correspond to DP178- 
1S like amino acid sequences present in nonviral 
organisms . 

The invention further relates to DP107, a 
peptide, which is also referred to herein as T21, 
corresponding to amino acids 558-595 of the HIV-1 MI 
transmembrane protein (TM) g P 41. The gp41 region'from 
which DP107 is derived is referred to herein as HRl 
The invention also relates to those portions and 
analogs of DP107 which that also show antiviral 
activity, and/or show anti -membrane fusion capability, 
or an ability to modulate intracellular processes 

25 involving coiled-coil peptide structures. The term 
"DP107 analog" as used herein refers to a peptide 
which contains an amino acid sequence corresponding to 
the DP107 peptide sequence present within the gp41 
protein of HIV-l^, but found in viruses and organisms 

30 other than HIV-l^. Suc h DP107 analog peptides may, 
therefore, correspond to DPl07-like amino acid 
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sequences present in other viruses, such as, for for 
example, enveloped viruses, such as retroviruses other 
than HIV-l^j, as well as non-enveloped viruses. 
Further, such DPI 07 analog peptides may also 
correspond to DP107-like amino acid sequences present 
5 in nonviral organisms. 

Further, the peptides of the invention include 
DP107 analog and DP178 analog peptides having amino 
acid sequences recognized or identified by the 
107x178x4, ALLMOTI5 and/or PLZIP search motifs 
10 described herein. 

The peptides of the invention may, for example, 
exhibit antif usogenic activity, antiviral activity, 
and/or may have the ability to modulate intracellular 
processes which involve coiled-coil peptide 
structures. With respect to the antiviral activity of 
the peptides of the invention, such an antiviral 
activity includes, but is not limited to the 
inhibition of HIV transmission to uninfected CD-4* 
cells. Additionally, the antif usogenic capability, 
antiviral activity or intracellular modulatory 

0 ft 

activity of the peptides of the invention merely 
requires the presence of the peptides of the 
invention, and, specifically, does not require the 
stimulation of a host immune response directed against 
such peptides. 

25 The peptides of the invention may be used, for 

example, as inhibitors of membrane fusion-asociated 
events, such as, for example, the inhibition of human 
and non-human retroviral, especially HIV, transmission 
to uninfected cells. It is further contemplated that 

3 0 the P e P tides of the invention may be used as 
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modulators of intracellular events involving coiled- 
coil peptide structures. 

The peptides of the invention may, alternatively, 
be used to identify compounds, including small 
molecule compounds, which may themselves exhibit 
5 antifusogenic, antiviral, or intracellular modulatory 
activity. For example, in one embodiment, the 
peptides of the invention are used to identify other 
DPl78-like and/or DPi07-like peptides that interact 
with each other and/or with their complementary HR1 or 
10 HR2 domains with a lower affinity than the affinity 
exhibited by the -parent" or "native" DP178 or DP107 
peptides from which they are derived. Such DPl78-like 
and DP107-like peptides, which are also part of the " 
present invention, may also be used, e.g., to identify 
15 compounds, such as small molecule compounds, that 
exhibit antifusogenic, antiviral, or intracellular 
modulatory activity. 

Additional uses include, for example, the use of 
the peptides of the invention as organism or viral 
^ type and/or subtype-specific diagnostic tools. 

The terms "antifusogenic" and " ant i -membrane 
fusion", as used herein, refer to an agent's ability' 
to inhibit or reduce the level of membrane fusion 
events between two or more moieties relative to the 
level of membrane fusion which occurs between said 
25 moieties in the absence of the peptide. The moieties 
may be, for example, cell membranes or viral 
structures, such as viral envelopes or pili. The term 
"antiviral", as used herein, refers to the compound's 
ability to inhibit viral infection of cells, via, for 
30 example, cell-cell fusion or free virus infection. 

Such infection may involve membrane fusion, as occurs 
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in the case of enveloped viruses, or some other fusion 
event involving a viral structure and a cellular 
structure ( e.g. . such as the fusion of a viral pilus 
and bacterial membrane during bacterial conjugation) . 
It is also contemplated that the peptides of the 
5 invention may exhibit the ability to modulate 

intracellular events involving coiled-coil peptide 
structures. "Modulate", as used herein, refers to a 
stimulatory or inhibitory effect on the intracellular 
process of interest relative to the level or activity 
10 of such a process in the absence of a peptide of the 
invention. 

Embodiments of the invention are demonstrated 
below wherein an extremely low concentration of DP178" 
(SEQ ID:1), and very low concentrations of a DP178 

^ homolog (SEQ ID: 3) are shown to be potent inhibitors 
of HIV-1 mediated CD-4+ cell-cell fusion ( i.e. , 
syncytial formation) and infection of CD-4 + cells by 
cell-free virus. Further, it is shown that DP178 (SEQ 
ID:1) is not toxic to cells, even at concentrations 3 
logs higher than the inhibitory DP- 178 (SEQ ID:1) 

20 concentration. 

The present invention is based, in part, on the 
surprising discovery that the DP107 and DP178 domains 
of the HIV gp41 protein non-covalently complex with 
each other, and that their interaction is required for 

25 the normal infectivity of the virus. This discovery 
is described in the Example presented, below, in 
Section 8. The invention, therefore, further relates 
to methods for identifying antifusogenic, including 
antiviral, compounds that disrupt the interaction 
between DP107 and DP178, and/or between DP107-like and 

30 

DP178-like peptides. 
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Additional embodiments of the invention 
(specifically, the Examples presents in Sections 9-16 
and 19-25, below) are demonstrated, below, wherein 
peptides, from a variety of viral and nonviral 
sources, having structural and/or amino acid motif 
similarity to DP107 and DP178 are identified, and 
search motifs for their identification are described. 
Further, Examples (in Sections 17, 18, 25-29) are 
presented wherein a number of the peptides of the 
invention are demonstrated exhibit substantial 
antiviral activity or activity predictive of antiviral 
activity. 

3.1. DEFINITIONS 

Peptides are defined herein as organic compounds 
comprising two or more amino acids covalently joined 
by peptide bonds. Peptides may be referred to with 
respect to the number of constituent amino acids, 
.L_e^, a dipeptide contains two amino acid residues, a 
tripeptide contains three, etc. Peptides containing 
ten or fewer amino acids may be referred to as 
oligopeptides, while those with more than ten amino 
acid residues are polypeptides. Such peptides may 
also include any of the modifications and additional 
amino and carboxy groups as are described herein. 

Peptide sequences defined herein are represented 

by one -letter symbols for amino acid residues as 

follows : 

A (alanine) 

R (arginine) 

N (asparagine) 

D (aspartic acid) 

C (cysteine) 

Q (glutamine) 

E (glutamic acid) 
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G (glycine) 
H (histidine) 
I (isoleucine) 
L (leucine) 
K (lysine) 
M (methionine) 
F (phenylalanine) 
5 P (proline) 
S (serine) 
T (threonine) 
W (tryptophan) 

Y (tyrosine) 

V (valine) 

4 . BRIEF DESCRIPTION OF THE FIGURES 
FIG. 1. Amino acid sequence of DP178 (SEQ ID:1) 
derived from HIV^; DP178 homologs derived from HIV-1 SF2 
(DP-185; SEQ ID : 3 ) , HIV-1 RF (SEQ ID:4), and HIV-l m (SEQ 
ID: 5); DP178 homologs derived from amino acid 
sequences of two prototypic HIV- 2 isolates, namely, 

15 HIV-2 rod (SEQ ID:6) and HIV-2 NIHZ (SEQ ID:7); control 

peptides: DP-180 (SEQ ID:2) , a peptide incorporating 
the amino acid residues of DP178 in a scrambled 
sequence; DP- 118 (SEQ ID: 10) unrelated to DP178, which 
inhibits HIV-1 cell free virus infection; DP-125 (SEQ 

20 ID:8), unrelated to DP178, also inhibits HIV-1 cell 
free virus infection; DP-116 (SEQ ID:9), unrelated to 
DP178, is negative for inhibition of HIV-1 infection 
when tested using a cell-free virus infection assay. 
Throughout the figures, the one letter amino acid code 
is used. 

25 

FIG. 2. Inhibition of HIV-l cell-free virus 
infection by synthetic peptides. IC 50 refers to the 
concentration of peptide that inhibits RT production 
from infected cells by 50% compared to the untreated 
control. Control: the level of RT produced by 
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untreated cell cultures infected with the same level 
of virus as treated cultures. 

FIG. 3. Inhibition of HIV-l and HIV-2 cell-free 
virus infection by the synthetic peptide DP178 (SEQ 
ID:1). ic 50 : concentration of peptide that inhibits 
5 RT production by 50% compared to the untreated 

control. Control: Level of RT produced by untreated 
cell cultures infected with the same level of virus as 
treated cultures. 

FIG. 4A-4B. Fusion Inhibition Assays, fig 4A: 
10 DP178 (SEQ ID:1) inhibition of HIV-l prototypic 

isolate-mediated syncytial formation; data represents 
the number of virus-induced syncytial per cell. FIG. 
4B: DP-180 (SEQ ID:2) represents a scrambled control" 
peptide; DP- 185 (SEQ ID: 3) represents a DP178 homolog 
i5 derived from HIV-l SP2 isolate; Control, refers to the 
number of syncytial produced in the absence of 
peptide. 

FIG. 5. Fusion inhibition assay: HIV-l vs. 
HIV-2. Data represents the number of virus-induced 
syncytial per well. nd : not done 

20 

FIG. 6. Cytotoxicity study of DP178 (SEQ ID:1) 
and DP-116 (SEQ ID:9) on CEM cells. Cell 
proliferation data is shown. 

FIG. 7. Schematic representation of HIV-gp4i 
and maltose binding protein (MBP) -gp41 fusion 

25 proteins. DP107 and DP178 are synthetic peptides 

based on the two putative helices of gp4l. The letter 
P in the DP107 boxes denotes an lie to Pro mutation at 
amino acid number 578. Amino acid residues are 
numbered according to Meyers et al., "Human 

3Q Retroviruses and AIDS", 1991, Theoret. Biol, and 

Biophys. Group, Los Alamos Natl. Lab., Los Alamos, NM. 
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The proteins are more fully described, below, in 
Section 8.1.1. 

FIG. 8. A point mutation alters the 
conformation and anti-HIV activity of M41. 

FIG. 9. Abrogation of DP178 anti-HIV activity. 
5 Cell fusion assays were carried out in the presence of 
10 nM DP178 and various concentrations of M41A178 or 
M41PA178. 

FIG. 10. Binding of DP178 to leucine zipper of 
gp41 analyzed by FAb-D ELISA. 

10 FIG. 11A-B. Models for a structural transition 

in the HIV-l TM protein. Two models are proposed 
which indicate a structural transition from a native 
oligomer to a fusogenic state following a trigger 
event (possibly gpl2 0 binding to CD4) . Common 
features of both models include (1) the native state 
is held together by noncovalent protein-protein 
interactions to form the heterodimer of gpl2 0/41 and 
other interactions, principally though gp41 
interactive sites, to form homo-oligomers on the virus 
surface of the gpl20/41 complexes; (2) shielding of 

20 the hydrophobic fusogenic peptide at the N-terminus 
(F) in the native state; and (3) the leucine zipper 
domain (DP107) exists as a homo-oligomer coiled coil 
only in the fusogenic state. The major differences in 
the two models include the structural state (native or 

25 fusogenic) in which the DP107 and DP178 domains are 
complexed to each other. In the first model (FIG. 
11A) this interaction occurs in the native state and 
in the second (FIG. 11B) , it occurs during the 
fusogenic state. When triggered, the fusion complex 
in the model depicted in (A) is generated through 
formation of coiled-coil interactions in homologous 
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DP107 domains resulting in an extended a-helix. This 
conformational change positions the fusion peptide for 
interaction with the cell membrane, m the second 
model (FIG. 11B) , the fusogenic complex is stabilized 
by the association of the DP178 domain with the DP107 
5 coiled-coil. 

FIG. 12. Motif design using heptad repeat 
positioning of amino acids of known coiled-coils. 

FIG. 13. Motif design using proposed heptad 
repeat positioning of amino acids of DP107 and DP178. 

10 FIG - 14 • H y farid ^tif design crossing GCN4 

and DP107. 

FIG. is. Hybrid motif design crossing GCN4 

and DP178 . 

FIG. 16. Hybrid motif design 107x178x4 
l5 crossing DP107 and DP178. This motif was found to be 
the most consistent at identifying relevant DPl07-like 
and DPl78-like peptide regions. 

FIG. 17. Hybrid motif design crossing GCN4 
DP107, and DP178. 

FIG - 18 • Hybrid motif design ALLMOTI5 
crossing GCN4, DP107, DP178, c-Pos c-Jun, c-Myc, and 
Flu Loop 36. 

FIG. 19. PLZ1P motifs designed fco identify 

N-terminal proline-leucine zipper motifs. 

FIG. 20. Search results for HIV-1 ( B RU 

25 isolate) enveloped protein gp 4 l. Sequence search 

motif designations: Spades (4) : 107x178x4; Hearts (V) 
ALLMOTI5; Clubs (*): PLZI p. Diamonds (f , . 
transmembrane region (the putative transmembrane 
domains were identified using a PC/Gene program 

30 designed to search for such peptide regions) . 
Asterisk (*): Lupas method . The ^ acid 
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identified by each motif are bracketed by the 
respective characters. Representative sequences 
chosen based on 107x178x4 searches are underlined and 
in bold. DP107 and DP178 sequences are marked, and 
additionally double-underlined and italicized. 
5 FIG. 21. Search results for human 

respiratory syncytial virus (RSV) strain A2 fusion 
glycoprotein Fl . Sequence search motif designations 
are as in FIG. 20. 

FIG. 22. Search results for simian 
10 immunodeficiency virus (SIV) enveloped protein gp41 
(AGM3 isolate) . Sequence search motif designations 
are as in FIG. 20 . 

FIG. 23. Search results for canine 
distemper virus (strain Onderstepoort ) fusion 
glycoprotein l. Sequence search motif designations 
are as in FIG. 20. 

FIG. 24. Search results for newcastle 
disease virus (strain Australia-Victoria/32) fusion 
glycoprotein Fl. Sequence search motif designations 
are as in FIG. 20. 
20 FIG. 25. Search results for human 

parainfluenza 3 virus (strain NIH 47885) fusion 
glycoprotein Fl. Sequence search motif designations 
are as in FIG. 20 . 

FIG. 26. Search results for influenza A 
25 virus (strain A/AICHI/2/68) hemagglutinin precursor 
HA2. Sequence search designations are as in FIG. 20. 

FIG. 27A-D. Respiratory Syncytial Virus 
(RSV) peptide antiviral and circular dichroism data. 
FIG. 27A-B: Peptides derived from the F2 DP178/DP107- 
like region. Antiviral and CD data. FIG. 2 7C-D: 
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Peptides derived from the Fl DPl07-like region. 
Peptide and CD data. 

Antiviral activity (AV) is represented by the 
following qualitative symbols: 

negative antiviral activity; 
5 "+/-", antiviral activity at greater than 

100/zg/ml; 

M + \ antiviral activity at between 50-100/ig/ml; 

"++", antiviral activity at between 20-50/zg/ml; 

"+++«, antiviral activity at between l-20/ig/ml; 
10 "++++", antiviral activity at <l^g/ml. 

CD data, referring to the level of helicity is 
represented by the following qualitative symbol: 
no helicity; 

"+ n , 25-50% helicity; 
15 "++", 50-75% helicity; 

"+++"• 75-100% helicity. 

IC S0 refers to the concentration of peptide 
necessary to produce only 50% of the number of 
syncytial relative to infected control cultures 
containing no peptide. ic 50 values were obtained using 
20 purified peptides only. 

FIG. 28A-B. Respiratory Syncytial Virus 
(RSV) DP178-like region (Fl) peptide antiviral and CD 
data. Antiviral symbols, CD symbols, and IC 50 are as 
in FIG. 27A-D. ic so values were obtained using 
25 purified peptides only. 

FIG. 29A-B. Peptides derived from the HPIV3 
Fl DPl07-like region. Peptide antiviral and CD data. 
Antiviral symbols, CD symbols, and IC 50 are as in FIG. 
27A-D. Purified peptides were used to obtain IC S0 
3Q values, except where the values are marked by an 
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asterisk (*), in which cases, the IC 50 values were 
obtained using a crude peptide preparation. 

FIG. 2 9C. HPIV3 peptide T-184 CD spectrum 
at 1°C in 0.1M NaCl lOmM KP0 4/ pH 7.0. The data 
demonstrates the peptide's helical secondary structure 
5 (0222/208=1 -2) over a wide range of concentrations (100- 
1500/xM) . This evidence is consistent with the peptide 
forming a helical coiled-coil structure. 

FIG. 3 0A-B. Peptides derived from the HPIV3 
Fl DPl78-like region. Peptide antiviral and CD data. 
10 Antiviral symbols, CD symbols, and IC 50 are as in FIG. 
27A-D. Purified peptides were used to obtain IC 50 
values, except where the values are marked by an 
asterisk (*), in which cases, the IC 50 values were "* 
obtained using a crude peptide preparation. 

FIG. 31. Motif search results for simian 
immunodeficiency virus (SIV) isolate MM251, enveloped 
polyprotein gp41. Sequence search designations are as 
in FIG. 20. 

FIG. 32. Motif search results for Epstein- 
Barr Virus (Strain B95-8) , glycoprotein gplio 

20 precursor (designated gpllS) . BALF4 . Sequence search 
designations are as in FIG. 20. 

FIG. 33. Motif search results for Epstein- 
Barr Virus (Strain B95-8) , BZLF1 trans -activator 
protein (designated EB1 or Zebra) . Sequence search 

25 designations are as in FIG. 20. Additionally, "@ n 
refers to a well known DNA binding domain and 
refers to a well known dimerization domain, as defined 
by Flemington and Speck (Flemington, E. and Speck, 
S.H., 1990, Proc. Natl. Acad. Sci. USA 87:9459-9463). 

, A FIG. 34. Motif search results for measles 
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virus (strain Edmonston) , fusion glycoprotein Pi. 
Sequence search designations are as in fig. 20. 

FIG. 35. Motif search results for Hepatitis 
B Virus (Subtype AYW) , major surface antigen precursor 
S. Sequence search designations are as in FIG. 20. 

FIG. 36. Motif search results for simian 
Mason- Pfizer monkey virus, enveloped (TM) protein 
gp20. sequence search designations are as in FIG. 20. 

FIG. 37. Motif search results for 
Pseudomonas aerginosa, fimbrial protein (Pilin) . 
10 Sequence search designations are as in FIG. 20. 

FIG. 38. Motif search results for Neisseria 
gonorrhoeae fimbrial protein (Pilin) . Sequence search 
designations are as in FIG. 20. 

FIG. 39. Motif search results for 
1S Hemophilus influenzae fimbrial protein. Sequence 
search designations are as in FIG. 20. 

FIG. 40. Motif search results for 
Staphylococcus aureus, toxic shock syndrome toxin- 1. 
Sequence search designations are as in FIG. 20. 

FIG. 41. Motif search results for 
Staphylococcus aureus enterotoxin Type E. Sequence 
search designations are as in FIG. 20. 

FIG. 42. Motif search results for 
Staphylococcus aureus enterotoxin A. Sequence search 
designations are as in FIG. 20. 
25 FIG - 43 • Motif search results for 

Escherichia coli, heat labile enterotoxin A. Sequence 
search designations are as in FIG. 20. 

FIG. 44. Motif search results for human c- 
fos proto-oncoprotein. Sequence search designations 
are as in FIG. 20. 



20 
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FIG. 45. Motif search results for human 
lupus KU autoantigen protein P70. Sequence search 
designations are as in FIG. 20. 

FIG. 46. Motif search results for human 
zinc finger protein 10. Sequence search designations 
5 are as in FIG. 20. 

FIG. 47. Measles virus (MeV) fusion protein 
DPl78-like region antiviral and CD data. Antiviral 
symbols, CD symbols, and IC 50 are as in FIG. 27A-D. 
IC 50 values were obtained using purified peptides. 
10 FIG. 48. Simian immunodeficiency virus 

(SIV) TM (fusion) protein DP178-like region antiviral 
data. Antiviral symbols are as in FIG. 27A-D "NT", 
not tested. 

FIG. 49A-C. DP178 -derived peptide antiviral 
data. The peptides listed herein were derived from 
the region surrounding the HIV-1 BRU isolate DP178 
region ( e.g. , gp41 amino acid residues 615-717) . 

In instances where peptides contained DP178 point 
mutations, the mutated amino acid residues are shown 
with a shaded background. In instances in which the 

20 test peptide has had an amino and/or carboxy- terminal 
group added or removed (apart from the standard amido- 
and acetyl- blocking groups found on such peptides), 
such modifications are indicated. FIG. 49A: The 
column to the immediate right of the name of the test 

25 peptide indicates the size of the test peptide and 
points out whether the peptide is derived from a one 
amino acid peptide "walk" across the DP178 region. 
The next column to the right indicates whether the 
test peptide contains a point mutation, while the 

30 column to ^ ts r ight indicates whether certain amino 
acid residues have been added to or removed from the 
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DP178 -derived amino acid sequence. FIG 49B: The 
column to the immediate right of the test peptide name 
indicates whether the peptide represents a DP178 
truncation, the next column to the right points out 
whether the peptide contains a point mutation, and the 
5 column to its right indicates whether the peptide 
contains amino acids which have been added to or 
removed from the DP178 sequence itself. FIG. 49C: 
The column to the immediate right of the test peptide 
name indicates whether the test peptide contains a 

10 point mutation, while the column to its right 

indicates whether amino acid residues have been added 
to or removed from the DP178 sequence itself. IC 50 is 
as defined in FIG. 27A-D, and IC 50 values were obtained 
using purified peptides except where marked with an 

is asterisk (*) , in which case the IC 50 was obtained using 
a crude peptide preparation. 

FIG. 50. DP107 and DP107 gp41 region 
truncated peptide antiviral data. IC S0 as defined in 
FIG. 2 7A-D, and IC 50 values were obtained using 
purified peptides except where marked with an asterisk 

20 

(*), in which case the IC 50 was obtained using a crude 
peptide preparation. 

FIG. 51A-B. Epstein-Barr virus Strain B95-8 
BZLFl DP178/DP107 analog region peptide walks and 
electrophoretic mobility shift assay results. The 
25 peptides (T-423 to T-446, FIG. 51A; T-447 to T-461, 
FIG. 51B) represent one amino acid residue "walks" 
through the EBV Zebra protein region from amino acid 
residue 173 to 246. 

The amino acid residue within this region which 
3q corresponds to the first amino acid residue of each 
peptide is listed to the left of each peptide, while 
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the amino acid residue within this region which 
corresponds to the last amino acid residue of each 
peptide is listed to the right of each peptide. The 
length of each test peptide is listed at the far right 
of each line, under the heading "Res". 
5 "ACT" refers to a test peptide's ability to 

inhibit Zebra binding to its response element. "+" 
refers to a visible, but incomplete, abrogation of the 
response element/Zebra homodimer complex; "+++» refers 
to a complete abrogation of the complex; and "-" 
10 represents a lack of complex disruption. 

FIG. 52A-B. Hepatitis B virus subtype AYW major 
surface antigen precursor S protein DP178/DP107 analog 
region and peptide walks. 52A depicts Domain I (S " 
protein amino acid residues 174-220), which contains a 
potential DP178/DP107 analog region. In addition, 
peptides are listed which represent one amino acid 
peptide "walks" through domain I. 52B depicts Domain 
II (S protein amino acid residues 233-291), which 
contains a second potential DP178/DP107 analog region. 
In addition, peptides are listed which represent one 

20 

amino acid peptide "walks" through domain II. 

FIG. 53: Cell fusion and competitive inhibition 
data for alanine walk experiments for the DP178-like 
Respiratory Syncytial Virus (RSV) peptide T112 . 

FIG. 54: Circular dichroism, cell fusion and 
25 competitive inhibition data for alanine walk 

experiments for the peptide T20, which is also known 
as DP178. 

5 - DETAILED DESCRIPTION OF THE INVENTION 
3Q Described herein are peptides which may exhibit 

antifusogenic activity, antiviral capability, and/or 
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the ability to modulate intracellular processes 
involving coiled-coil peptide structures. The 
peptides described include, first, DP178 (SEQ ID 
N0:1), a g P 41-derived 36 amino acid peptide and 
fragments and analogs of DP178 . 

In addition, the peptides of the invention 
described herein include peptides which are DP107 
analogs. DP107 (SEQ ID NO: 25) is a 38 amino acid 
peptide corresponding to residues 558 to 595 of the 
HIV-1^ transmembrane (TM) g P 41 protein. Such DP107 
analogs may exhibit antifusogenic capability, 
antiviral activity or an ability to modulate 
intracellular processes involving coiled-coil 
structures . 

Further, peptides of the invention include DP107 
and DP178 are described herein having amino acid 
sequences recognized by the 107x178x4, ALLMOTI5, and 
PLZIP search motifs. Such motifs are also discussed. 

Also described here are antifusogenic, antiviral, 
intracellular modulatory, and diagnostic uses of the 
peptides of the invention. Further, procedures are 
described for the use of the peptides of the invention 
for the identification of compounds exhibiting 
antifusogenic, antiviral or intracellular modulatory 
activity. 

While not limited to any theory of operation, the 
following model is proposed to explain the potent 
anti-HIV activity of DP178, based, in part, on the 
experiments described in the Examples, infra , m the 
HIV protein, gp 4 l, DP178 corresponds to a putative a- 
helix region located in the C-terminal end of the gp4l 
ectodomain, and appears to associate with a distal 
site on g P 4l whose interactive structure is influenced 
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by the leucine zipper motif, a coiled- coil structure, 
referred to as DP107. The association of these two 
domains may reflect a molecular linkage or "molecular 
clasp 1 ' intimately involved in the fusion process. It 
is of interest that mutations in the C- terminal a- 
5 helix motif of gp41 ( i.e. , the D178 domain) tend to 
enhance the fusion ability of gp41, whereas mutations 
in the leucine zipper region ( i.e. . the DP107 domain) 
decrease or abolish the fusion ability of the viral 
protein. It may be that the leucine zipper motif is 
10 involved in membrane fusion while the C- terminal oc- 
helix motif serves as a molecular safety to regulate 
the availability of the leucine zipper during virus- 
induced membrane fusion. 

On the basis of the foregoing, two models are 
15 proposed of gp41 -mediated membrane fusion which are 
schematically shown in FIG. 11A-B. The reason for 
proposing two models is that the temporal nature of 
the interaction between the regions defined by DP107 
and DP178 cannot, as yet, be pinpointed. Each model 
envisions two conformations for gp41 - one in a 
20 "native" state as it might be found on a resting 

virion. The other in a "fusogenic" state to reflect 
conformational changes triggered following binding of 
gpl20 to CD4 and just prior to fusion with the target 
cell membrane. The strong binding affinity between 
25 gpl2 0 and CD4 may actually represent the trigger for 
the fusion process obviating the need for a pH change 
such as occurs for viruses that fuse within 
intracellular vesicles. The two major features of 
both models are: (i) the leucine zipper sequences 
3q (DP107) in each chain of oligomeric enveloped are held 
apart in the native state and are only allowed access 
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to one another in the fusogenic state so as to form 
the extremely stable coiled-coils, and (2) association 
of the DP178 and DP107 sites as they exist in g P 4l 
occur either in the native or fusogenic state. FIG. 
11A depicts DP178/DP107 interaction in the native 
5 state as a molecular clasp. On the other hand, if one 
assumes that the most stable form of the enveloped 
occurs in the fusogenic state, the model in PIG. 11B 
can be considered. 

When synthesized as peptides, both DP107 and 
10 DP178 are potent inhibitors of HIV infection and 

fusion, probably by virtue of their ability to form 
complexes with viral g P 4l and interfere with its 
fusogenic process,- e.g., during the structural 
transition of the viral protein from the native 
i5 structure to the fusogenic state, the DP178 and DP107 
peptides may gain access to their respective binding 
sites on the viral g P 41, and exert a disruptive 
influence. DP107 peptides which demonstrate anti-HIV 
activity are described in Applicants' co-pending - 
application Serial No. 08/264,531, filed June 23, 
20 1994, which is incorporated by reference herein in its 
entirety. 

As shown in the Examples, infra , a truncated 
recombinant gp4l protein corresponding to the 
ectodomain of gp4l containing both DP107 and DP178 

25 domains (excluding the fusion peptide, transmembrane 
region and cytoplasmic domain of gp4l) did not inhibit 
HIV-i induced fusion. However, when a single mutation 
was introduced to disrupt the coiled-coil structure of 
the DP107 domain - a mutation which results in a 

3Q total loss of biological activity of DP107 peptides - 
the inactive recombinant protein was transformed to an 
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active inhibitor of HIV-1 induced fusion. This 
transformation may result from liberation of the 
potent DP178 domain from a molecular clasp with the 
leucine zipper, DP107 domain. 

For clarity of discussion, the invention will be 
5 described primarily for DP178 peptide inhibitors of 
HIV. However, the principles may be analogously 
applied to other viruses, both enveloped and 
nonenveloped, and to other non-viral organisms. 
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5.1. DP178 AND DP178-LIKE PEPTTDRfi 

The DP178 peptide (SEQ ID:1) of the invention 
corresponds to amino acid residues 63 8 to 673 of the 
transmembrane protein gp4l from the HIV-l^ isolate, 
and has the 3 6 amino acid sequence (reading from amino 
to carboxy terminus) : 

NH 2 -YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-COOH (SEQ ID:1) 

In addition to the full-length DP178 (SEQ ID:1) 
36-mer, the peptides of the invention may include 
truncations of the DP178 (SEQ ID:1) peptide which 
exhibit antifusogenic activity, antiviral activity 
and/or the ability to modulate intracellular processes 
involving coiled-coil peptide structures. Truncations 
of DP178 (SEQ ID:l) peptides may comprise peptides of 
between 3 and 36 amino acid residues ( i.e. . peptides 
ranging in size from a tripeptide to a 36-mer 
polypeptide) , as shown in Tables I and IA, below. 
Peptide sequences in these tables are listed from 
amino (left) to carboxy (right) terminus. »x« may 
represent an amino group (-NH 2 ) and »z» may represent a 
carboxyl (-COOH) group. Alternatively, «x» may 
represent a hydrophobic group, including but not 
limited to carbobenzyl, dansyl, or T-butoxycarbonyl; 
an acetyl group; a 9-f luorenylmethoxy-carbonyl (FMOC) 
25 group; or a covalently attached macromolecular group, 
including but not limited to a lipid-fatty acid 
conjugate, polyethylene glycol, carbohydrate or 
peptide group. Further, »z» may represent an amido 
group; a T-butoxycarbonyl group; or a covalently 
3Q attached macromolecular group, including but not 

limited to a lipid-fatty acid conjugate, polyethylene 
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glycol, carbohydrate or peptide group. A preferred 
"X" or "Z" macromolecular group is a peptide group. 



TABLE I 

DP178 (SEP ID:1) CARBOXY TRUNCATIONS 

5 X-YTS-Z 

X-YTSL-Z 

X-YTSLI-Z 

X-YTSLIH-Z 

X-YTSLIHS-Z 

X-YTSLIHSL-Z 

X-YTSLIHSLI-Z 

X-YTSLIHSLIE-Z 
10 X-YTSLIHSLIEE-Z 

X-YTSLIHSLIEES-Z 

X-YTSLIHSLIEESQ-Z 

X-YTSLIHSLIEESQN-Z 

X-YTSLIHSLIEESQNQ-Z 

X-YTSLIHSLIEESQNQQ-Z 

X-YTSLIHSLIEESQNQQE-Z 

X-YTSLIHSLIEESQNQQEK-Z 
15 X - YTSLIHSLIEESQNQQEKN- Z 

X-YTSLIHSLIEESQNQQEKNE-Z 

X - YTSLIHSLIEESQNQQEKNEQ - Z 

X-YTSLIHSLIEESQNQQEKNEQE-Z 

X- YTSLIHSLIEESQNQQEKNEQEL- Z 

X-YTSLIHSLIEESQNQQEKNEQELL-Z 

X-YTSLIHSLIEESQNQQEKNEQELLE-Z 

X-YTSLIHSLIEESQNQQEKNEQELLEL-Z 
20 X-YTSLIHSLIEESQNQQEKNEQELLELD-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDK-Z 

X - YTSL IHSLI EE SQNQQEKNEQELLELDKW - Z 

X-YTSLIHSLIEES QNQQEKNEQELLELDKWA- Z 

X- YTSLIHSLIEESQNQQEKNEQELLELDKWAS - Z 

X - YTSL IHSL IEESQNQQEKNEQELLELDKWASL - Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLW-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-Z 
2 5 X- YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW- Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 



The one letter amino acid code is used. 
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TABLE IA 

DP178 (SEP IDijJ AMTKTO TRUNCATTOTflg 

X-NWF-2 
X-WNWF-Z 
X-LWNWF-Z 
X-SLWNWF-Z 
X-ASLWNWF-Z 
X-WASLWNWF-Z 
X-KWASLWNWF-Z 
X-DKWASLWNWF-Z 
X-LDKWASLWNWF-Z 
X - ELDKWAS LWNWF - Z 
X - LELDKWASLWNWF - Z 
X - LLELDKWASLWNWF - Z 
X - ELLE LDKWAS LWNWF - Z 
X - QELLELDKWAS LWNWF - Z 
X - EQELLELDKWASLWNWF - Z 
X - NEQELLELDKWASLWNWF - Z 
X - KNEQELLELDKWASLWNWF - Z 
X - EKNEQELLELDKWASLWNWF - Z 
X - QEKNEQELLELDKWAS LWNWF - Z 
X - QQEKNEQELLELDKWAS LWNWF - Z 
X - NQQEKNEQELLELDKWASLWNWF - Z 
X - QNQQEKNEQELLELDKWASLWNWF - Z 
X - SQNQQEKNEQELLELDKWASLWNWF - Z 
X-ESQNQQEKNEQELLELDKWASLWNWF-Z 
X-EESQNQQEKNEQELLELDKWASLWNWF- Z 
X-IEESQNQQEKNEQELLELDKWASLWNWF-Z 
X - LIEESQNQQEKNEQELLELDKWASLWNWF - Z 
X - SLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
X-HSLIEES QNQQE KNEQE LLELDKWAS LWNWF - Z 
X- IHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
X - LIHSLIEESQNQQEKNEQELLELDKWASLWNWF - Z 
X^SLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
v ^f^ HSLIEESQNQQE ^ EQELLELDKW ASLWNWF^Z 
X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

The one letter amino acid code is used 

25 
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The peptides of the invention also include DP178- 
like peptides. "DPl78-like" , as used herein, refers, 
first, to DP178 and DP178 truncations which contain 
one or more amino acid substitutions, insertions 
and/or deletions. Second, "DP -17 8 -like" refers to 
5 peptide sequences identified or recognized by the 

ALLMOTI5, 107x178x4 and PLZIP search motifs described 
herein, having structural and/or amino acid motif 
similarity to DP178. The DP178-like peptides of the 
invention may exhibit antif usogenic or antiviral 

10 activity, or may exhibit the ability to modulate 
intracellular processes involving coiled-coil 
peptides. Further, such DP178-like peptides may 
possess additional advantageous features, such as, for 
example, increased bioavailability, and/or stability, 

^ or reduced host immune recognition. 

HIV-l and HIV- 2 enveloped proteins are 
structurally distinct, but there exists a striking 
amino acid conservation within the DP178 -corresponding 
regions of HIV-l and HIV-2. The amino acid 
conservation is of a periodic nature, suggesting some 

20 conservation of structure and/or function. Therefore, 
one possible class of amino acid substitutions would 
include those amino acid changes which are predicted 
to stabilize the structure of the DP178 peptides of 
the invention. Utilizing the DP178 and DP178 analog 

25 sequences described herein, the skilled artisan can 
readily compile DP178 consensus sequences and 
ascertain from these, conserved amino acid residues 
which would represent preferred amino acid 
substitutions. 

3Q The amino acid substitutions may be of a 

conserved or non- conserved nature. Conserved amino 
acid substitutions consist of replacing one or more 
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amino acids of the DP178 (SEQ ID:1) peptide sequence 
with amino acids of similar charge, size, and/or 
hydrophobicity characteristics, such as, for example, 
a glutamic acid (E) to aspartic acid (D) amino acid 
substitution. Non-conserved substitutions consist of 
5 replacing one or more amino acids of the DP178 (SEQ 
ID:1J peptide sequence with amino acids possessing 
dissimilar charge, size, and/or hydrophobicity 
characteristics, such as, for example, a glutamic acid 
(E) to valine (V) substitution. 
10 Amino acid insertions may consist of single amino 

acid residues or stretches of residues. The 
insertions may be made at the carboxy or amino 
terminal end of the DP178 or DP178 truncated peptides, 
as well as at a position internal to the peptide. 
is Such insertions will generally range from 2 to 15 
amino acids in length. It is contemplated that 
insertions made at either the carboxy or amino 
terminus of the peptide of interest may be of a 
broader size range, with about 2 to about 50 amino 
acids being preferred. One or more such insertions 
20 may be introduced into DP178 (SEQ.ID:1) or DP178 
truncations, as long as such insertions result in 
peptides which may still be recognized by the 
107X178X4, ALLMOTI5 or PLZIP search motifs described 
herein, or may, alternatively, exhibit antif usogenic 
25 or antiviral activity, or exhibit the ability to 

modulate intracellular processes involving coiled-coil 
peptide structures. 

Preferred amino or carboxy terminal insertions 
are peptides ranging from about 2 to about 50 amino 
30 acid residues in length, corresponding to g P 41 protein 
regions either amino to or carboxy to the actual DPI 7 8 
9P41 amino acid sequence, respectively. Thus, a 

- 32 - 



WO 01/51673 PCT/US00/35727 



preferred amino terminal or carboxy terminal amino 
acid insertion would contain gp41 amino acid sequences 
found immediately amino to or carboxy to the DP178 
region of the gp41 protein. 

Deletions of DP178 (SEQ ID:1) or DP178 
5 truncations are also within the scope of the 

invention. Such deletions consist of the removal of 
one or more amino acids from the DP178 or DP178-like 
peptide sequence, with the! lower limit length of the 
resulting peptide sequence being 4 to 6 amino acids. 
10 Such deletions may involve a single contiguous or 
greater than one discrete portion of the peptide 
sequences. One or more such deletions may be 
introduced into DP178 (SEQ.ID:1) or DP178 truncations, 
as long as such deletions result in peptides which may 
still be recognized by the 107x178x4, ALLM0TI5 or 
PLZIP search motifs described herein, or may, 
alternatively, exhibit antifusogenic or antiviral 
activity, or exhibit the ability to modulate 
intracellular processes involving coiled-coil peptide 
structures. 

2 0 

DP178 analogs are further described, below, in 
Section 5.3. 

5.2. DP107 AND DP107-LIKE PEPTIDES 
Further, the peptides of the invention include 
25 peptides having amino acid sequences corresponding to 
DP107 analogs. DP107 is a 38 amino acid peptide which 
exhibits potent antiviral activity, and corresponds to 
residues 558 to 595 of HIV-1^ transmembrane (TM) gp41 
protein, as shown here: 

30 

NH 2 - NNLLRAIEAQQHLLQLTVWQ I KQLQARILAVERYLKDQ - COOH 

(SEQ ID:25) 
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10 



In addition to the full-length DP107 (SEQ ID: 25) 
38-mer, the peptides of the invention may include 
truncations of the DP107 (SEQ ID:25) peptide which 
exhibit antifusogenic activity, antiviral activity 
and/or the ability to modulate intracellular processes 
involving coiled-coil peptide structures. Truncations 
of DP107 (SEQ ID:25) peptides may comprise peptides of 
between 3 and 38 amino acid residues ( i.e. . peptides 
ranging in size from a tripeptide to a 38-mer 
polypeptide) , as shown in Tables II and IIA, below. 
Peptide sequences in these tables are listed from 
amino (left) to carboxy (right) terminus, »x« may 
represent an amino group (-NH 2 ) and «Z'< may represent a 
carboxyl (-COOH) group. Alternatively, «x» may 
represent a hydrophobic group, including but not 
is limited to carbobenzyl, dansyl, or T-butoxycarbonyl ; 
an acetyl group; a 9-f luorenylmethoxy-carbonyl (FMOC) 
group; or a covalently attached macromolecular group, 
including but not limited to a lipid-fatty acid 
conjugate, polyethylene glycol, carbohydrate or 
peptide group. Further, »z« may represent an amido 
group; a T-butoxycarbonyl group; or a covalently 
attached macromolecular group, including but not 
limited to a lipid-fatty acid conjugate, polyethylene 
glycol, carbohydrate or peptide group. A preferred 
"X" or «z« macromolecular group is a peptide group. 



25 
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TABLE II 

DP107 (SEO ID: 25) CARBOXY TRUNCATIONS 



X-NNL-Z 

X-NNLL-Z 

X-NNLLR-Z 
5 X-NNLLRA-Z 

X - NNLLRAI -Z 

X-NNLLRAIE-Z 

X-NNLLRAIEA-Z 

X - NNLLRAI E AQ - Z 

X - NNLLRAI E AQQ - Z 

X - NNLLRAI E AQQH - Z 

X - NNLLRAI E AQQHL - Z 
10 X - NNLLRAI EAQQHLL - Z 

X - NNLLRAIE AQQHLLQ - Z 

X - NNLLRAI EAQQHLLQL - Z 

X - NNLLRAI EAQQHLLQLT - Z 

X - NNLLRAI EAQQHLLQLTV - Z 

X - NNLLRAIEAQQHLLQLTVW - Z 

X - NNLLRAI E AQQHLLQLTVWQ - Z 

X - NNLLRAI E AQQHLLQLTVWQI - Z 
15 X - NNLLRAI E AQQHLLQLTVWQ IK- Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQ - Z 

X - NNLLRA I E AQQHLLQLTVWQ I KQL - Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQ - Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQ A - Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQ AR - Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQAR I - Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQ AR I L - Z 
20 X - NNLLRAI E AQQHLLQLTVWQ I KQLQAR I LA - Z 

X-NNLLRAIEAQQHLLQLTVWQIKQLQARILAV-Z 

X-NNLLRAIEAQQHLLQLTVWQIKQLQARILAVE-Z 

X - NNLLRAI EAQQHLLQLT VWQ I KQLQARI LAVER - Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQAR I LAVER Y - Z 

X - NNLLRAI E AQQHLLQ LT VWQ I KQLQAR I LAVERYL - Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQARI LAVERYLK - Z 

X - NNLLRAI E AQQHLLQ LT VWQ I KQLQAR I LAVERYLKD - Z 
25 X - NNLLRAI EAQQHLLQLT VWQ I KQLQAR I LAVERYLKDQ - Z 



The one letter amino acid code is used. 
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TABLE IIA 
DPI 7 8 (SEP ID: 2 5) AMINO TRITNCATIONS 



X-KDQ- Z 
X-LKDQ- Z 
X-YLKDQ- Z 
X-RYLKDQ- Z 
X-ERYLKDQ- Z 
X-VERYLKDQ- Z 
X-AVERYLKDQ- Z 
X - LAVERYLKDQ - Z 
X - ILAVERYLKDQ - Z 
X - RILAVERYLKDQ - Z 
X-ARILAVERYLKDQ- Z 
X-QAR I LAVERYLKDQ- Z 
X - LQARI LAVERYLKDQ - Z 
X - QLQARILAVERYLKDQ - Z 
X - KQLQARILAVERYLKDQ - Z 
X-IKQLQARILAVERYLKDQ- Z 
X - Q I KQLQARILAVERYLKDQ - _Z 
X-WQIKQLQARILAVERYLKDQ- Z 
X-VWQIKQLQARILAVERYLKDQ- Z 
X-TVWQIKQLQARILAVERYLKDQ- Z 
X - LTVWQ I KQLQARILAVERYLKDQ - Z 
X - QLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X-LQLTVWQIKQLQARILAVERYLKDQ- Z 
X- LLQLTVWQIKQLQARILAVERYLKDQ - Z 
X - HLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - QHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - QQHLLQLT VWQ I KQLQAR I LAVERYLKDQ - Z 
X - AQQHLLQLTVWQI KQLQARILAVERYLKDQ - Z 
X - EAQQHLLQLTVWQ I KQLQARILAVERYLKDQ - Z 
X- IEAQQHLLQLTVWQIKQLQAR ILAVERYLKDQ- Z 
X - AIEAQQHLLQLT VWQ I KQLQAR ILAVERYLKDQ - Z 
X-RAIEAQQHLLQLTVWQIKQLQARILAVERYLKDQ- Z 
X - LRAIE AQQHLLQLT VWQ I KQLQAR I LAVERYLKDQ - Z 
X- LLRAI EAQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X -NLLRAIEAQQHLLQLTVWQI KQLQARILAVERYLKDQ - Z 
X -NNLLRAIEAQQHLLQLTVWQIKQLQARILAVERYLKDQ - Z 

The one letter amino acid code is used. 
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The peptides of the invention also include DP107- 
like peptides. "DPl07-like n , as used herein, refers, 
first, to DP107 and DP107 truncations which contain 
one or more amino acid substitutions, insertions 
and/or deletions. Second, M DP-107-like ft refers to 
5 peptide sequences identified or recognized by the 

ALLMOTI5 , 107x178x4 and PL2IP search motifs described 
herein, having structural and/or amino acid motif 
similarity to DP107. The DP107-like peptides of the 
invention may exhibit antifusogenic or antiviral 

10 activity, or may exhibit the ability to modulate 
intracellular processes involving coiled-coil 
peptides. Further, such DP107-like peptides may 
possess additional advantageous features, such as, for 
example, increased bioavailability, and/or stability, 

^ or reduced host immune recognition. 

HIV-l and HIV-2 enveloped proteins are 
structurally distinct, but there exists a striking 
amino acid conservation within the DP10 7 -corresponding 
regions of HIV-l and HIV-2. The amino acid 
conservation is of a periodic nature, suggesting some 

20 conservation of structure and/or function. Therefore, 
one possible class of amino acid substitutions would 
include those amino acid changes which are predicted 
to stabilize the structure of the DP107 peptides of 
the invention. Utilizing the DP107 and DP107 analog 

25 sequences described herein, the skilled artisan can 
readily compile DP107 consensus sequences and 
ascertain from these, conserved amino acid residues 
which would represent preferred amino acid 
substitutions. 

3o The amino acid substitutions may be of a 

conserved or non-conserved nature. Conserved amino 
acid substitutions consist of replacing one or more 
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amino acids of the DP107 (SEQ ID: 25) peptide sequence 
with amino acids of similar charge, size, and/or 
hydrophobicity characteristics, such as, for example, 
a glutamic acid (E) to aspartic acid (D) amino acid 
substitution. Non-conserved substitutions consist of 
5 replacing one or more amino acids of the DP107 (SEQ 
ID: 25) peptide sequence with amino acids possessing 
dissimilar charge, size, and/or hydrophobicity 
characteristics, such as, for example, a glutamic acid 
(E) to valine (V) substitution. 
10 Amino acid insertions may consist of single amino 

acid residues or stretches of residues. . The 
insertions may be made at the carboxy or amino 
terminal end of the DP107 or DP107 truncated peptides, 
as well as at a position internal to the peptide. 
i5 Such insertions will generally range from 2 to 15 
amino acids in length. It is contemplated that 
insertions made at either the carboxy or amino 
terminus of the peptide of interest may be of a 
broader size range, with about 2 to about 5 0 amino 
acids being preferred. One or more such insertions 
20 may be introduced into DP107 (SEQ.ID:25) or DP107 
truncations, as long as such insertions result in 
peptides which may still be recognized by the 
107x178x4, ALLMOTI5 or PLZIP search motifs described 
herein, or may, alternatively, exhibit antif usogenic 
25 or antiviral activity, or exhibit the ability to 

modulate intracellular processes involving coiled-coil 
peptide structures. 

Preferred amino or carboxy terminal insertions 
are peptides ranging from about 2 to about 50 amino 
3q acid residues in length, corresponding to gp41 protein 
regions either amino to or carboxy to the actual DP107 
gp41 amino acid sequence, respectively. Thus, 



a 

38 - 



WO 01/51673 



PCT/US00/35727 



preferred amino terminal or carboxy terminal amino 
acid insertion would contain gp41 amino acid sequences 
found immediately amino to or carboxy to the DP107 
region of the gp41 protein. 

Deletions of DP107 (SEQ ID:25) or DP178 
5 truncations are also within the scope of the 

invention. Such deletions consist of the removal of 
one or more amino acids from the DP107 or DPl07-like 
peptide sequence, with the lower limit length of the 
resulting peptide sequence being 4 to 6 amino acids. 
10 Such deletions may involve a single contiguous or 
greater than one discrete portion of the. peptide 
sequences. One or more such deletions may be 
introduced into DP107 (SEQ. ID: 25) or DP107 
truncations, as long as such deletions result in 
peptides which may still be recognized by the 
107x178x4, ALLMOTI5 or PLZIP search motifs described 
herein, or may, alternatively; exhibit antifusogenic 
or antiviral activity, or exhibit the ability to 
modulate intracellular processes involving coiled-coil 
peptide structures. 
^ u DP107 and DP107 truncations are more fully 

described in Applicants 1 co-pending U.S. Patent 
Application Ser. No. 08/374,666, filed January 27, 
1995, and which is incorporated herein by reference in 
its entirety. DP107 analogs are further described, 
25 below, in Section 5.3. 

5.3. DP107 and DP178 ANALOGS 

Peptides corresponding to analogs of the DP178, 
DP178 truncations, DP107 and DP107 truncation 
3q sequences of the invention, described, above, in 
Sections 5.1 and 5.2 may be found in other viruses, 
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including, for example, non-HIV-l^ enveloped viruses, 
non-enveloped viruses and other non-viral organisms. 

The term "analog", as used herein, refers to a 
peptide which is recognized or identified via the 
107x178x4, ALLMOTI5 and/or PL2IP search strategies 
5 discussed below. Further, such peptides may exhibit 
antifusogenic capability, antiviral activity, or the 
ability to modulate intracellular processes involving 
coiled-coil structures. 

Such DP178 and DP107 analogs may, for example, 

10 correspond to peptide sequences present in TM proteins 
of enveloped viruses and may, additionally correspond 
to peptide sequences present in non enveloped and non- 
viral organisms. Such peptides may exhibit 
antifusogenic activity, antiviral activity, most 
particularly antiviral activity which is specific to 
the virus in which their native sequences are found, 
or may exhibit an ability to modulate intracellular 
processes involving coiled-coil peptide structures. 

DP178 analogs are peptides whose amino acid 
sequences are comprised of the amino acid sequences of 

20 peptide regions of, for example, other ( i.e. , other 
than HIV-l^j) viruses that correspond to the gp41 
peptide region from which DP178 (SEQ ID:1) was 
derived. Such viruses may include, but are not 
limited to, other HIV-1 isolates and HIV-2 isolates. 

25 DP178 analogs derived from the corresponding gp41 
peptide region of other ( i.e. , non HIV-l^) HIV-l 
isolates may include, for example, peptide sequences 
as shown below. 

3Q NH 2 -YTOTIYTLLEESQNQQEKNEQELLELDKWASLWNWF-COOH 

(DP-185; SEQ ID: 3) ; 
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^2 "^T^IYNLLEESQNQQEKNEQELLELDKWAIJLWNWF- COOItSEQ ID:4) ; 

NH 2 - YTSLIYSLLEKSQIQQEKNEQELLELDKWASLWNWF - COOH 

(SEQ ID:5) . 

5 SEQ ID:3 (DP-185), SEQ ID:4, and SEQ ID:5 are derived 
from HIV-l SF2/ HIV-l„, and HIV-1^ isolates, 
respectively. Underlined amino acid residues refer to 
those residues that differ from the corresponding 
position in the DP178 (SEQ ID:1) peptide. One such 
10 DP178 analog, DP-185 (SEQ ID:3) , is described in the 
Example presented in Section 6, below, where it is 
demonstrated that DP-185 (SEQ ID: 3) exhibits antiviral 
activity. The DP178 analogs of the invention may also 
include truncations, as described above. Further, the 
analogs of the invention modifications such those 
described for DP178 analogs in Section 5.1., above. 
It is preferred that the DP178 analogs of the 
invention represent peptides whose amino acid 
sequences correspond to the DP178 region of the gp4l 
protein, it is also contemplated that the peptides of 
20 the invention may, additionally, include amino 

sequences, ranging from about 2 to about 50 amino acid 
residues in length, corresponding to gp41 protein 
regions either amino to or carboxy to the actual DP178 
amino acid sequence. 
25 Striking similarities, as shown in FIG. 1, exist 

within the regions of HIV-l and HIV-2 isolates which 
correspond to the DP178 sequence. A DP178 analog 
derived from the HIV-2 NIH2 isolate has the 36 amino acid 
sequence (reading from amino to carboxy terminus) : 

30 

NH 2 - LEANI SQSLEQAQIQQEKNMYELQKLNS WDVFTNWL - COOH (SEQ ID:7) 
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Table III and Table IV show some possible truncations 
of the HIV-2 NIHZ DP178 analog, which may comprise 
peptides of between 3 and 36 amino acid residues 
( i-e- / peptides ranging in size from a tripeptide to a 
36-mer polypeptide) . Peptide sequences in these ' 
5 tables are listed from amino (left) to carboxy (right) 
terminus. "X" may represent an amino group (-NH 2 ) and 
"Z" may represent a carboxyl (-COOH) group. 
Alternatively, "X" may represent a hydrophobic group, 
including but not limited to carbobenzyl, dansyl, or 
10 T-butoxycarbonyl; an acetyl group; a 9- 

f luorenylmethoxy-carbonyl (FMOC) group; or a 
covalently attached macromolecular group, including 
but not limited to a lipid-fatty acid conjugate, 
polyethylene glycol, carbohydrate or peptide group. 
Further, "Z" may represent an amido group; a T- 
butoxycarbonyl group; or a covalently attached 
macromolecular group, including but not limited to a 
lipid-fatty acid conjugate, polyethylene glycol, 
carbohydrate or peptide group. A preferred "X" or "Z" 
macromolecular group is a peptide group. 



25 



30 



- 42 - 



WO 01/51673 



PCTAJS00/35727 



TABLE III 

HIV-2 NIHZ DP178 analog carboxy truncations. 

X-LEA-Z 

X-LEAN-Z 

X-LEANI-Z 

X-LEANIS-Z 
5 X-LEANISQ-Z 

X-LEANISQS-Z 

X-LEANISQSL-Z 

X-LEANISQSLE-Z 

X-LEANISQSLEQ-Z 

X-LEANISQSLEQA-Z 

X - LEANI S QSLEQAQ - Z 

X-LEANISQSLEQAQI-Z 
1.0 X-LEANISQSLEQAQIQ-Z 

X-LEANISQSLEQAQIQQ-Z 

X - LEANI SQSLEQAQIQQE - Z 

X-LEANISQSLEQAQIQQEK-Z 

X - LEANI SQSLEQAQIQQEKN- Z 

X-LEANISQSLEQAQIQQEKNM-Z 

X - LEANI SQSLEQAQIQQEKNMY - Z 

X - LEANI SQSLEQAQIQQEKNMYE - Z 
15 X- LEANI SQSLEQAQIQQEKNMYEL-Z 

X - LEANI SQSLEQAQIQQEKNMYELQ - Z 

X-LEANISQSLEQAQIQQEKNMYELQK-Z 

X - LEANI SQSLEQAQIQQEKNMYELQKL - Z 

X - LEANI SQSLEQAQIQQEKNMYELQKLN-Z 

X-LEANISQSLEQAQIQQEKNMYELQKLNS-Z 

X- LEANI SQSLEQAQIQQEKNMYELQKLNSW-Z 

X - LEANI SQSLEQAQ IQQEKNMYELQKLNS WD - Z 
20 X - LEANISQSLEQAQIQQEKNMYELQKLNSWDV- Z 

X - LEANI SQSLEQAQIQQEKNMYELQKLNSWDVF - Z 

X - LEAN I S Q S LE Q AQ I QQE KNMYELQKLNS WD VF T - Z 

X - LEANI SQSLEQAQIQQEKNMYELQKLNSWDVFTN- Z 

X-LEANISQSLEQAQIQQEKNMYELQKLNSWDVFTNW-Z 

X - LEANI SQSLEQAQIQQEKNMYELQKLNSWDVFTNWL - Z 



The one letter amino acid code is used. 

25 
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TABLE IV 

HIV-2 NIH2 DP178 analog amino truncations. 

X-NWL-Z 
X-TNWL-Z 
X-FTNWL-Z 
X-VFTNWL-Z 
X-DVFTNWL-Z 

5 X- WDVFTNWL -Z 

X-SWDVFTNWL-Z 
X-NSWDVFTNWL-Z 
X-LNSWDVFTNWL-Z 
X - KLNSWDVFTNWL - Z 
X - QKLNS WDVFTNWL - Z 
X - LQKLNS WDVFTNWL - Z 
X -ELQKLNS WDVFTNWL - Z 

10 X - YELQKLNS WDVFTNWL - Z 

X -MYELQKLNS WDVFTNWL - Z 
X-NMYELQKLNSWDVFTNWL- Z 
X- KNMYELQKLNS WDVFTNWL - Z 
X- EKNMYELQKLNS WDVFTNWL - Z 
X - QEKNMYELQKLNSWDVFTNWL - Z 
X -QQEKNMYELQKLNS WDVFTNWL - Z 
X - IQQEKNMYELQKLNS WDVFTNWL - Z 
15 X - QIQQEKNMYELQKLNSWDVFTNWL - Z 

X-AQIQQEKNMYELQKLNS WDVFTNWL -Z 
X - QAQ IQQEKNMYELQKLNS WDVFTNWL - Z 
X-EQAQIQQEKNMYELQKLNSWDVFTNWL- Z 
X - LEQAQIQQEKNMYELQKLNSWDVFTNWL- Z 
X- SLEQAQIQQEKNMYELQKLNSWDVFTNWL- Z 
X- QSLEQAQIQQEKNMYELQKLNSWDVFTNWL - Z 
X-SQSLEQAQIQQEKNMYELQKLNSWDVFTNWL-Z 
20 X- ISQSLEQAQIQQEKNMYELQKLNSWDVFTNWL- Z 

X-NISQSLE QAQ I QQE KNMYELQKLNS WDVFTNWL - Z 
X-ANISQSLEQAQIQQEKNMYELQKLNSWDVFTNWL-Z 
X - EANI SQSLEQAQIQQEKNMYELQKLNSWDVFTNWL - Z 
X - LEAN I S Q S LEQ AQ I QQE KNM YELQ KLNS WDVFTNWL - Z 



The one letter amino acid code is used. 

25 



30 



- 44 - 



WO 01/51673 



PCT/US00/35727 



DP178 and DP107 analogs are recognized or 
identified, for example, by utilizing one or more of 
the 107x178x4, ALLMOTT5 or PLZIP computer-assisted 
search strategies described and demonstrated, below, 
in the Examples presented in Sections 9 through 16 and 
5 19 through 25. The search strategy identifies 

additional peptide regions which are predicted to have 
structural and/or amino acid sequence features similar 
to those of DP107 and/or DP178 . 

The search strategies are described fully, below, 
10 in the Example presented in Section 9. While this 

search strategy is based, in part, on a primary amino 
acid motif deduced from DP107 and DP178, it is not 
based solely on searching for primary amino acid 
sequence homologies, as such protein sequence 
homologies exist within, but not between major groups 
of viruses. For example, primary amino acid sequence 
homology is high within the TM protein of different 
strains of HIV-l or within the TM protein of different 
isolates of simian immunodeficiency virus (SIV) . 
Primary amino acid sequence homology between HIV-l and 

2 0 

SIV, however, is low enough so as not to be useful. 
It is not possible, therefore, to find peptide regions 
similar to DP107 or DP178 within other viruses, or 
within non- viral organisms, whether structurally, or 
otherwise, based on primary sequence homology, alone. 

25 Further, while it would be potentially useful to 

identify primary sequence arrangements of amino acids 
based on, for example, the physical chemical 
characteristics of different classes of amino acids 
rather than based on the specific amino acids 

3Q themselves, such search strategies have, until now, 
proven inadequate. For example, a computer algorithm 
designed by Lupas et al . to identify coiled-coil 
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propensities of regions within proteins (Lupas, A., et 
al., 1991 Science 252:1162-1164) is inadequate for 
identifying protein regions analogous to DP107 or 
DP178. 

Specifically, analysis of HIV-1 gpl60 (containing 
5 both gpl2 0 and gp41) using the Lupas algorithm does 
not identify the coiled-coil region within DP107. It 
does, however, identify a region within DP178 
beginning eight amino acids N- terminal to the start of 
DP178 and ending eight amino acids from the C- 

10 terminus. The DP107 peptide has been shown 

experimentally to form a stable coiled coil. A search 
based on the Lupas search algorithm, therefore, would 
not have identified the DP107 coiled-coil region. 
Conversely, the Lupas algorithm identified the DP178 ~ 

is region as a potential coiled-coil motif. However, the 
peptide derived from the DP178 region failed to form a 
coiled coil in solution. 

A possible explanation for the inability of the 
Lupas search algorithm to accurately identify coiled- 
coil sequences within the HIV-1 TM, is that the Lupas 

20 algorithm is based on the structure of coiled coils 
from proteins that are not structurally or 
functionally similar to the TM proteins of viruses, 
antiviral peptides ( e.g. DP107 and DP178) of which are 
an object of this invention. 

25 The computer search strategy of the invention, as 

demonstrated in the Examples presented below, in 
Sections 9 through 16 and 19 through 25, successfully 
identifies regions of proteins similar to DP107 or 
DP178. This search strategy was designed to be used 

3q with a commercially-available sequence database 
package, preferably PC/Gene. 
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A series of search motifs, the 107x178x4, 
ALLMOTI5 and PLZIP motifs, were designed and 
engineered to range in stringency from strict to 
broad, as discussed in this Section and in Section 9, 
with 107x178x4 being preferred. The sequences 
5 identified via such search motifs, such as those 
listed in Tables V-XIV, below, potentially exhibit 
antifusogenic, such as antiviral, activity, may 
additionally be useful in the identification of 
antifusogenic, such as antiviral, compounds, and are 
10 intended to be within the scope of the invention. 

Coiled-coiled sequences are thought to consist of 
heptad amino acid repeats. For ease of description, 
the amino acid positions within the heptad repeats are 
sometimes referred to as A through G, with the first * 
position being A, the second B, etc. The motifs used 

15 

to identify DPl07-like and DPl78-like sequences herein 
are designed to specifically search for and identify 
such heptad repeats. In the descriptions of each of 
the motifs described, below, amino acids enclosed by 
brackets , i.e., [] , designate the only amino acid 

20 residues that are acceptable at the given position, 
while amino acids enclosed by braces, i.e., {}, 
designate the only amino acids which are unacceptable 
at the given heptad position. When a set of bracketed 
or braced amino acids is followed by a number in 

25 parentheses i.e., () , it refers to the number of 
subsequent amino acid positions for which the 
designated set of amino acids hold, e.g, a (2) means 
"for the next two heptad amino acid positions". 
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The ALLMOTI5 is written as follows: 

{CDGHP}-{CFP} (2) -{CDGHP}-{CFP} (3) - 
{CDGHP}-{CFP} (2)-{CDGHP}-{CFP} (3)- 
{CDGHP}-{CFP} (2) -{CDGHP}-{CFP} (3)- 
{CDGHP}-{CFP} (2) -{CDGHP}-{CFP} (3) - 
{CDGHP}-{CFP} (2)-{CDGHP}-{CFP} (3) - 

5 Translating this motif, it would read: "at the 

first (A) position of the heptad, any amino acid 

residue except C, D, G, H, or P is acceptable, at the 

next two (B,C) amino acid positions, any amino acid 

residue except C, F, or P is acceptable, at the fourth 

^ heptad position (D) , any amino acid residue except C, 
D, G, H, or P is acceptable, at the next three (E, F, 
G) amino acid positions, any amino acid residue except 
C, F, or P is acceptable. This motif is designed to 
search for five consecutive heptad repeats (thus the' " 
repeat of the first line five times) , meaning that it 

15 searches for 35-mer sized peptides. It may also be 
designed to search for 2 8-mers, by only repeating the 
initial motif four times. With respect to the 
ALLMOTI5 motif, a 35-mer search is preferred. Those 
viral (non-bacteriophage) sequences identified via 

2 0 such an ALLMOTI5 motif are listed in Table V in U.S. 
Patent Application No. 08/470,896 filed on June 6, 
1995 which is incorporated herein by reference in its 
entirety. These viral sequences potentially exhibit 
antiviral activity, may be useful in the the 
identification of antiviral compounds, and are 
intended to be within the scope of the invention. In 
those instances wherein a single gene exhibits greater 
than one sequence recognized by the ALLMOTI5 search 
motif, the amino a cid residue numbers of these 
sequences are listed under "Area 2", Area 3", etc. 

30 This convention is used for each of the Tables listed, 
below, at the end of this Section. 
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The 107x178x4 motif is written as follows: 

[EFIKLNQSTVWY] -{ CFMP } (2) - [EFIKLNQSTVWY] - { CFMP) (3) - 
[EFIKLNQSTVWY] -{CFMP} (2) - [EFIKLNQSTVWY - CFMP 3 - 
EFIKLNQSTVWY] -{CFMP} (2) - [EFIKLNQSTVWY] - CFMP 3 - 
[EFIKLNQSTVWY] -{CFMP} (2) - [EFIKLNQSTVWY] -{ CFMP} (3) - 
Translating this motif, it would read: "at the 
5 first (A) position of the heptad, only amino acid 
residue E, F, I, K, L, N, Q, S, T, V, W, or Y is 
acceptable, at the next two (B,C) amino acid 
positions, any amino acid residue except C, F, M or P 
is acceptable, at the fourth position (D) , only amino 
10 acid residue E, F, i, K , L, n, q, s, t, v, w, or Y is 
acceptable, at the next three (E, F, G) amino acid 
positions, any amino acid residue except C, F, M or P 
is acceptable. This motif is designed to search for 
four consecutive heptad repeats (thus the repeat of " 
15 the first line four times) , meaning that it searches 
for 28-mer sized peptides. It may also be designed to 
search for 35-mers, by repeating the initial motif 
five times, with respect to the 107x178x4 motif, a 
28-mer search is preferred. 

Those viral (non-bacteriophage) sequences 
identified via such a 107x178x4 motif are listed in 
Table VI in U.S. Patent Application No. 08/470,896 
filed on June 6, 1995, which is incorporated herein, 
by reference, in its entirety. Those viral (non- 
bacteriophage) sequences listed in Table VII of U.S. 
25 Patent Application no. 08/470,896 (incorporated herein 
by reference in its entirety) are particularly 
preferred. 

The 107x178x4 search motif was also utilized to 
identify non-viral procaryotic protein sequences, as 
30 listed in Table VIII in U.S. Patent Application No. 
08/470,896 filed on June 6, 1995, which is 
incorporated herein, by reference, in its entirety. 
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Further, this search motif was used to reveal a number 
of human proteins. The results of this human protein 
107x178x4 search is listed in Table IX in U.S. Patent 
Application No. 08/470,896 filed on June 6, 1995, 
which is incorporated herein, by reference, in its 
5 entirety. The sequences listed in Tables VIII and IX, 
therefore, reveal peptides which may be useful as 
antifusogenic compounds or in the identification of 
antifusogenic compounds, and are intended to be within 
the scope of the invention. 

10 The PLZIP series of motifs are as listed in FIG. 

19. These motifs are designed to identify leucine 
zipper coiled-coil like heptads wherein at least one 
proline residue is present at some predefined distance 
N-terminal to the repeat. These PLZIP motifs find 
regions of proteins with similarities to HIV-l DP178 
generally located just N-terminal to the transmembrane 
anchor. These motifs may be translated according to 
the same convention described above. Each line 
depicted in FIG. 19 represents a single, complete 
search motif. «x« in these motifs refers to any amino 

20 acid residue. In instances wherein a motif contains 
two numbers within parentheses, this refers to a 
variable number of amino acid residues. For example, 
X (1,12) is translated to "the next one to twelve 
amino acid residues, inclusive, may be any amino 

25 acid" . 

Tables X through XIV in U.S. Patent Application 
No. 08/470,896 filed on June 6, 1995 (which is 
incorporated herein, by reference, in its entirety) , 
list sequences identified via searches conducted with 
3Q such PLZIP motifs. Specifically, Table X lists viral 
sequences identified via PCTLZIP, P1CTLZIP and 
P2CTLZIP search motifs, Table XI lists viral sequences 
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identified via P3CTLZIP, P4CTLZIP, P5CTLZIP and 
P6CTLZIP search motifs, Table XII lsts viral sequences 
identified via P7CTLZIP, P8CTLZIP and P9CTLZIP search 
motifs, Table XIII lists viral sequences identified 
via P12LZIPC searches and Table XIV lists viral 
5 sequences identified via P23TLZIPC search motifs The 
viral sequences listed in these tables represent 
peptides which potentially exhibit antiviral activity, 
may be useful in the identification of antiviral 
compounds, and are intended to be within the scope of 
10 the invention. 

The Examples presented in Sections 17, 18, 2 6 and 
2 7 below, demonstrate that viral sequences identified 
via the motif searches described herein identify 
substantial antiviral characteristics. Specifically," 
is the Example presented in Section 17 describes peptides 
with anti-respiratory syncytial virus activity, the 
Example presented in Section 18 describes peptides 
with anti-parainfluenza virus activity, the Example 
presented in Section 26 describes peptides with anti- 
measles virus activity and the Example presented in 
20 Section 27 describes peptides with anti-simian 
immunodeficiency virus activity. 

The DP107 and DP178 analogs may, further, contain 
any of the additional groups described for DP178, 
above, in Section 5.1. For example, these peptides 
25 may include any of the additional amino- terminal 

groups as described above for "X" groups, and may also 
include any of the carboxy- terminal groups as 
described, above, for "Z" groups. 

Additionally, truncations of the identified DP107 
3Q and DP178 peptides are among the peptides of the 

invention. Further, such DP107 and DP178 analogs and 
DP107/DP178 analog truncations may exhibit one or more 
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amino acid substitutions, insertion, and/or deletions. 
The DP178 analog amino acid substitutions, insertions 
and deletions, are as described, above, for DP178-like 
peptides in Section 5.1. The DP-107 analog amino acid 
substitutions, insertions and deletions are also as 
5 described, above, for DP107-like peptides in Section 
5.2. Representative examples of such DP107/DP178 
truncations are provided in Tables XV through XXII of 
U.S. Patent Application Serial No. 08/470,896 filed on 
June 6, 1995, which is incorporated herein by 
10 reference in its entirety. 

Other exemplary DP178 and DP107 peptides and 
DP178-like and DP107-like peptides which are 
considered part of the present invention include the 
peptides described in U.S. Patent Application Serial 
No. 09/315,304 filed on May 4, 1999 which is 
incorporated by reference in its entirety. Such DP178 
and DP107 peptides and DP178-like and DP107-like 
peptides include, e.g., the peptides listed below in 
Table V. 

Other DP178, DP107, DP178-like and DP107-like 

20 

peptides include peptides described, e.g., in U.S. 
Patent Application Serial No. 08/038,387 filed on 
March 29, 1993, now U.S. Patent No. 5,627,023; in 
U.S. Patent Application Serial No. 08/073,028 filed on 
June 7, 1993, now U.S. Patent No. 5,464,933; in U.S. 
25 Patent Application Serial No. 08/255,208 filed on 

June, 7, 1994; in U.S. Patent Application Serial No. 
08/360,107 filed on December 20, 1994 and in U.S. 
Patent Application Serial No. 08/470,896 filed on June 
6, 1995 each of which is incorporated herein by 
reference in its entirety. 
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TABLE V 

T 

No. Sequence 



1 GIKQLQARILAVERYLKDQ — 

2 NNLLRAIEAQQHLLQLTVW 

3 NEQELLELDKWASLWNWF 

4 YTSLIHSLIEESQNQQEK 

5 Ac - VWG I KQLQARILAVERYLKDQQLLG I WG - NH2 
5 6 QHLL QLTVWGI KQLQARILAVERYLKDQ 

7 LRAIEAQQHLLQLTVWGIKQLQARILAV 

8 VQQQNNLLARIEAQQHLLQLTVWGIKQL 

9 RQLLSGIVQQQNNLLRAIEAQQHLLQLT 

1 0 MTLTVQARQLLSGI VQQQNNLLRAIEAQ 

12 WSLSNGVSVLTSKVLDLKUYIDKQLL 

13 LLSTNKAWSLSNGVSVLTSKVLDLKNY 

15 Ac - VLHLEGEVNKIKSALLSTNKAVVSLSNG-NH2 

10 19 AC-LLSTNKAWSLSNGVSVLTSKVLDLKNY-NH2 

2 0 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

2 1 Ac - NNLLRAIEAQQHLLQLTVWG XKQLQARILAVERYLKDQ -NH2 

22 Ac - IELSNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQST - NH2 
2 3 Ac - IELSNIKENKCNGTDAKVKL IKQELDKY - NH2 

24 Ac - ENKCNGTDAKVKL IKQELDKYKNAVTEL -NH2 

25 AC -DAKVKLIKQELDKYKNAVTELQLLMQST-NH2 

26 AC - CNGTDAKVKL IKQELDKYKNAVTELQLL -KH2 

15 27 AC-SNIKENKCTGTDAKVKLIKQELDKYKNAVTELQLL-NH2 

2 8 AC - AS GVAVSKVLHLEGEVNKI KS ALL STNKAWSLSNGV -NH2 

29 AC - SGVAVSKVLHLEGEVNKIKSALLSTNKAVVSLSNG-NH2 

30 Ac - VLHLEGEVNKIKSALLSTHKAWSLSNGVSVLTSK-NH2 

31 AC - ARKLQRMKQLED KVEELLS KNYHYLENEVARLKKLV - NH2 

3 2 Ac -RMKQLEDKVEELLSKNYHYLENEVARLKKLVGER-NH2 
3 3 Ac - VQQQNNLLRAIEAQQHLLQLTVWGIKQL-KH2 

34 Ac -LRAIEAQQHLLQLTVWGIKQLQARILAV-NH2 

2Q 35 AC-QHLLQLTWGIKQLQARILAVERYLKDQ-NH2 

36 Ac -RQLLSGIVQQQNNLLRAIEAQQHLLQLT-NH2 

3 7 Ac -MTLTVQARQLLSGIVQQQNNLLRAIEAQ-NH2 

3 8 Ac -AKQARSDIEKLKEAIRDTNKAVQSVQSS -NH2 

3 9 Ac - AAVALVEAKQARSD IEKLKEAIRDTNKAVQS VQS S -NH2 

4 0 Ac - AKQARSDIEKLKEAIRDTNKAVQSVQSSIGNLIVA-NH2 

41 Ac - GT I AL G VATS AQ ITAAVALVEAKQARS D - NH 2 

42 Ac -ATSAQITAAVALVEAKQARSDIEKLKEA-NH2 
25 4 3 Ac -AAVALVEAKQARSDIEKLKEAIRDTNKANH2 

44 Ac - IEKLKEAIRDTNKAVQSVQSS IGNLIVA-NH2 

45 Ac - IRDTNKAVQSVQSS IGNLIVAIKSVQDY-NH2 

46 AC -AVQSVQSS IGNLIVAIKSVQDYVNKEIV-NH2 

47 Ac -QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLARILAVERYLKDQ -NH2 
4 8 Ac -QARQLLSGIVQQQNNLLRAIEAQQHLLQ-NH2 

49 AC-MTWMEMDREINNYTSLIGSLIEESQNQQEKNEQELLELDKWASLWWWF-NH2 

50 AC-WMEWDREINNYTSLIGSLIEESQNQQEKNEQELLE-NH2 

51 Ac - INNYTSLIGSLIEESQNQQEKNEQELLE-NH2 

52 AC-INNYTSLIGSLIEESQNQQEKNEQELLELDKWASL-NH2 

53 AC-EWDREINNYTSLIGSLIEESQNQQEKNEQEGGC-NH2 

54 Ac -QSRTLLAGIVQQQQQLLDWKRQQELLR-NH2 
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55 Ac-NNDrWQEWERKVDFLEENITALLEEAQIQQEKNMYELQKLNSWD-NH2 

56 AC-WQEWERKVDFLEENITALLEEAQIQQEK-NH2 

57 Ac - VDFLEENITALLEEAQIQQEKNMYELQK-NH2 

58 AC - ITALLEEAQIQQEKNMYELQKLNSWDVF-NH2 

59 AC - SSESFTLLEQWNNWKLQIAEQWLEQINEKHYLEDIS -NH2 
€0 Ac -DKWASLWNWF-NH2 

61 Ac - NEQELLELDKWAS LWNWF -NH2 

62 Ac -EKNEQELLELDKWASLWNWF-NH2 

63 AC -NQQEKNEQELLELDKWASLWNWF-NH2 

64 AC-ESQNQQEKNEQELLELDKWASLWNWF-NH2 

65 AC-LIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

66 Ac - NDQKKLMSNNVQI VRQQS YS IMS I IKEE - NH2 

67 AC -DEFDAS ISQVNEKINQSLAFIRKSDELL -NH2 

68 AC-VSKGYSALRTGWYTSVITIELSNIKEN-NH2 

69 AC-WSLSNGVSVLTSKVLDLKNYIDKQLL-NH2 

70 AC-VNKIKSALLSTNKAWSLSNGVSVLTSK-NH2 

71 AC - P X INF YDPLVFP SDEFDAS I S QVNEKINQS LAF IR-NH2 

72 Ac -NLVYAQLQFTYDTLRGYINRALAQIAEA-NH2 

73 Ac - LNQVDLTETLERYQQRLNT YALVS KDAS YRS -NH2 

74 Ac - ELLVLKKAQLNRHS YLKDSDFLDAALD - NH2 

75 AC -LAEAGEESVTEDTEREDTEEEREDEEE-NH2 

76 Ac -ALLAEAGEESVTEDTEREDTEEEREDEEEENEART-NH2 

77 AC -ETERSVDLVAALLAEAGEESVTEDTEREDTEEERE -NH2 

7 8 AC -EESVTEDTEREDTEEEREDEEEENEART-NH2 
79 AC - VDLVAALLAEAGEESVTEDTEREDTEEE -NH2 

8 0 Ac -NSETERSVDLVAALLAEAGEESVTE-NH2 

81 Ac -DISYAQLQFTYDVLKDYINDALRNIMDA-NH2 

82 AC-SNVFSKDEIMREYNSQKQHIRTLSAKVNDN-NH2 

83 Biotin-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

84 Dig-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

8 5 B i o t in -NNLLRAIEAQQHLLQLTVWGIKQLQARIIiAVERYLKDQ - NH2 

86 Dig-NNLLRAIEAQQHLLQLTVWGIKQLQARIIiAVERYLKDQ-NH2 
8 7 - Ac -VLHQLNIQLKQYLETQERLLAGNRIAARQLLQIWKDVA-NH2 

8 8 AC-LWHEQLLNTAQRAGLQLQLINQALAVREKVLIRYDI0K-NH2 

89 AC-LLDNFESTWEQSKELWEQQEISIQNLHKSALQEYW-NH2 

90 Ac -LSNLLQISNNSDEWLEALEIEHEKWKLTQWQSYEQF-NH2 

91 Ac - KLEALEGKLEALEGKLEALEGKLEALEGKLEALEGK- NH2 

92 AC -ELRALRGELRALRGELRALRGELRALRGK-NH2 

93 Ac - ELKAKELE GEGLAEGEEALKGLLEKAAKLEGLELLK-NH 2 

94 Ac -WEAAAREAAAREAAAREAAARA-NH2 

95 Ac -YTSLIHSLIEESQNQQERNEQELLELDKWASLWNAF-NH2 

96 Ac -YTSLIHSLIEESQNQQEKNEQELLELDKWASLANWF-NH2 

97 Ac - YTSLIHSLIEESQNQQEKNQQELLELDKWASLWNWF-NH2 

98 Ac - YTSLIHSLIEESQNQQEKNEQELLQLDKWASLWNWF-NH2 

99 AC - YTSLIHSLIEESQNQQEKNQQELLQLDKWASLWNWF-NH2 

100 Ac - RMKQLEDKVEELLS KNYHLENE VARLKKLVGER - NH2 

101 AC - QQLLQLTVWG I KQLQARI LAVERYLKNQ -NH2 

102 Ac -NEQELLELDKWAS LWNWF -NH2 

103 Ac - YTSLIQSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

104 AC-IINFYDPLVFPSDEFDASISQVNEKINQSLAFIRK-NH2 
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105 AC-INFYDPLVFPSDEFDASISQVNEKINQSLAFIRKS-NH2 

106 Ac -NFYDPLVFPSDEFDAS ISQVNEKINQSLAFIRKSD -NH2 

107 Ac -FYDPLVFPSDEFDAS ISQVNEKINQSLAFIRKSDE -NH2 

108 AC-YDPLVFPSDEFDASISQVNEKINQSLAFIRKSDEL-NH2 

109 Ac -DPLVFPSDEFDAS ISQVNEKINQSLAFIRKSDELL -NH2 

110 Ac - PLVFPSDEFDAS ISQVNEKINQSLAFIRKSDELLH -NH2 

111 Ac -LVFPSDEFDASISQVNEKINQSLAFIRKSDELLHN-NH2 

112 Ac - VF PSDE FDAS I S QVNEKINQSLAF IRKSDELLHNV - NH2 

113 Ac - FPSDEFDASISQVNEKINQSLAFIRKSDELLHNVN-NH2 

114 Ac - PSDEFDAS ISQVNEKINQSLAFIRKSDELLHNVNA-NH2 

115 AC - SDEFDAS ISQVNEKINQSLAFIRKSDELLHNVNAG -NH2 

116 AC-DEFDASISQVNEKINQSLAFIRKSDELLHNVNAGK-NH2 

117 AC-EFDASISQTOEKINQSLAFIRKSDELLHNVNAGKS-NH2 

118 Ac - FDAS I S QVNEKINQSLAFI RKSDELLHNVNAGKST - NH2 
10 119 AC -DAS ISQVNEKINQSLAF IRKSDELLHNVNAGKSTT -NH2 

12 0 Ac - ASGVAVS KVLHLEGEVNKIKS ALLSTNKAWSLSN - NH2 

121 Ac - SGVAVSKVLHLEGEVNKIKSALLSTNKAWSLSNG-NH2 

122 Ac - G VA VS KVLHLEGE VNKI KS ALLS TNKA WS LSNGV - NH2 
12 3 AC - VAVS KVLHLEGEVNKI KS ALLS TNKA WS LSNG VS -NH2 

124 Ac -AVSKVLHLEGEVNKIKSALLSTNKAWSLSNGVSV-NH2 

125 Ac - VS KVLHLEGEVNKI KSALLSTNKAWSLSNGVSVL-NH2 

126 AC-SKVLHLEGEVNKIKSALLSTNKAWSLSNGVSVLT-NH2 
15 127 Ac - KVLHLEGEVNKIKSALLSTNKAWSLSNGVSVLTS -NH2 

12 8 Ac -VLHLEGEVNKIKSALLSTNKAWSLSNGVSVLTSK-NH2 

129 Ac - LHLEGE VNKI KS ALLS TNKAWS LSNGVS VLTS KV - NH2 

130 Ac -HLEGEVNKIKSALLSTNKAWSLSNGVSVLTSKVL-NH2 

131 AC-LEGEVNKIKSALLSTNKAWSLSNGVSVLTSKVLD-NH2 

13 2 Ac - EGEVNKI KSALLSTNKAWSLSNGVS VLTS KVLDL - NH2 
13 3 Ac - GEVNKIKSALLSTNKAWSLSNGVSVLTSKVLDLK-NH2 
134 Ac -EVNKI KSALLSTNKAWSLSNGVS VLTS KVLDLKN-NH2 

2 0 135 Ac- VNKI KSALLSTNKAVVS LSNGVS VLTS KVLDLKNY-NH2 

136 Ac -NK IKS ALLS TNKA WS LSNGVS VLTS KVLDLKNY I -NH2 

137 Ac - KIKS ALLSTNKAWS LSNGVS VLTS KVLDLKNYID -NH2 
13 8 Ac - IKS ALLSTNKAWSLSNGVS VLTSKVLDLKNYIDK-NH2 

13 9 Ac- KSALLSTNKAWSLSNGVS VLTS KVLDL KNYIDKQ-NH2 

14 0 Ac - SALLSTNKAWSLSNGVSVLTSKVLDLKNYIDKQL-NH2 

141 Ac - ALLS TNKAWS LSNGVS VLTS KVLDLKNYIDKQLL - NH2 

142 Ac - YTS VIT I ELSNI KENKCNGTDAKVKL I KQELDKYK- NH2 
2 5 143 AC-TSVITIELSNIKENKCNGTDAKVKLIKQELDKYKN-NH2 

144 AC-SVITIELSNIKENKCNGTDAKVKLIKQELDKYKNA-NH2 

14 5 AC - VIT IELSNIKENKCNGTDAKVKLIKQELDKYKNAV- NH2 

14 6 Ac - 1TIELSNIKENKCNGTDAKVKLIKQELDKYKNAVT -NH2 

147 Ac -TIELSNI KENKCNGTDAKVKL I KQELDKYKNAVTE -NH2 

14 8 Ac - IELSNIKENKCNGTDAKVKL I KQELDKYKNAVTEL - NH2 

14 9 Ac - EL SNI KENKCNGTDAKVKL I KQELDKYKNAVTELQ -NH2 

150 Ac -LSNI KENKCNGTDAKVKL IKQELDKYKNAVTELQL-NH2 

151 Ac - SNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLL -NH2 

152 Ac - NI KENKCNGTDAKVKL I KQELDKYKNAVTELQLLM -NH2 

153 Ac- IKENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQ-NH2 

154 Ac - KENKCNGTDAKVKL I KQELDKYKNAVTELQLLMQS -NH2 



30 



- 55 - 



WO 01/51673 



PCT/US00/35727 



T 

No. Sequence 



15 S Ac -ENKCNGTOAKVKLIKQELDKYKNAVTELQLLMQST-NH2 

156 AC -LLDNFESTWEQSKELWELQEISIQNLHKSALQEYWN-NH2 

157 AC - ALGVATSAQITAAVALVEAKQARSDIEKLKEAIRD -NH2 

158 Ac -LGVATSAQITAAVALVEAKQARSDIEKLKEAIRDT-NH2 

159 AC - GVATS AQ ITAAVALVEAKQARSD IEKLKEAIRDTN -NH2 

160 Ac - VATSAQITAAVALVEAKQARSDIEKLKEAIRDTNK-NH2 
5 161 Ac -ATSAQ ITAAVALVEAKQARSD IEKLKEAIRDTNKA-NH2 

162 Ac -TSAQITAAVALVEAKQARSDIEKLKEAIRDTNKAV-NH2 

163 Ac -SAQ ITAAVALVEAKQARSD I EKLKEAIRDTNKAVQ-NH2 

164 Ac - AQITAAVALVEAKQARSDIEKLKEAIRDTNKAVQS -NH2 

165 Ac - Q ITAAVALVEAKQARSD IEKLKEAIRDTNKAVQS V -NH2 

166 Ac - ITAAVALVEAKQARSD IEKLKEAIRDTNKAVQSVQ -NH2 

167 Ac - TAAVALVEAKQARSD IE KLKEAIRDTNKAVQS VQS -NH2 

168 Ac - AAVALVEAKQARSD IEKLKEAIRDTNKAVQS VQS S -NH2 
10 169 AC - AVAL VEAKQARSD IEKLKEAIRDTNKAVQS VQS S I -NH2 

170 Ac -VALVEAKQARSD IEKLKEAIRDTNKAVQS VQS S IG - NH2 

171 Ac -ALVEAKQARSDIEKLKEAIRDTNKAVQSVQSSIGN-NH2 

172 Ac -LVEAKQARSDIEKLKEAIRDTNKAVQSVQSSIGNL-NH2 

173 Ac -VEAKQARSD IEKLKEAIRDTNKAVQS VQS SIGNLI-NH2 

174 Ac - EAKQARSD IEKLKEAIRDTNKAVQS VQS S IGNL IV - NH2 

175 Ac - KQARSDIEKLKEAIRDTNKAVQSVQS SIGNLIVAI -NH2 

176 Ac -QARSDIEKLKEAIRDTNKAVQSVQSS IGNLIVAIK-NH2 
15 177 Ac -ARSDIEKLKEAIRDTNKAVQSVQSS IGNLIVAIKS -NH2 

178 Ac - RSD IEKLKEAIRDTNKAVQ S VQS S IGNL IVAIKSV -NH2 

179 Ac -SD IEKLKEAIRDTNKAVQS VQS S IGNL I VAI KSVQ-NH2 

180 AC-DIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQD-NH2 

181 Ac - IEKLKEAIRDTNKAVQS VQS S IGNL I VAIKS VQDY - NH2 

182 AC-EKLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQDYV-NH2 

183 Ac - KLKEAIRDTNKAVQS VQS S IGNLIVAIKS VQDYVN - NH2 

184 AC - LKEAIRDTNKAVQS VQS S IGNL I VAI KS VQDYVNK - NH2 

2 q 185 Ac - KEA IRDTNKAVQS VQS S IGNL I VAIKS VQDYVNKE -NH2 

186 Ac -EAIRDTNKAVQSVQSS IGNLIVAIKS VQDYVNKE I -NH2 

187 Ac - AIRDTNKAVQS VQS S IGNL I VAI KS VQDYVNKE IV-NH2 
18 8 AC -IRDTNKAVQSVQSSIGNLIVAIKSVQDYVNKEIV-NH2 
18 9 Ac- YTPNDITLNNSVALDPIDISIELNKAKSDLEESKE-NH2 

190 Ac -TPND ITLNNS VALDPIDIS IELNKAKSDLEESKEW-NH2 

191 Ac - PNDITLNNS VALDPIDIS IELNKAKSDLEESKEWI -NH2 

192 AC-NDITLNNSVALDPIDISIELNKAKSDLEESKEWIR-NH2 

193 AC-DITLNNSVALDPIDISIELNKAKSDLEESKEWIRR-NH2 

194 AC - ITLNNSVALDPIDISIELNKAKSDLEESKEWIRRS -NH2 

195 AC-TLNNSVALDPIDISIELNKAKSDLEESKEWIRRSN-NH2 

196 Ac -LNNSVALDPIDISIELNKAKSDLEESKEWIRRSNQ-NH2 

197 Ac -NNS VALDPIDIS IELNKAKSDLEESKEWIRRSNQK-NH2 

198 Ac - NS VALDPIDIS I ELNKAKSDLEESKEWIRRSNQKL -NH2 

200 Ac -S VALDPIDIS IELNKAKSDLEESKEWIRRSNQKLD-NH2 

201 Ac -VALDPIDIS IELNKAKSDLEESKEWIRRSNQKLDS -NH2 

202 Ac - ALDPIDIS IELNKAKSDLEESKEWIRRSNQKLDS I -NH2 

3 0 203 Ac - LDP ID IS IELNKAKSDLEESKEWIRRSNQKLDS IG -NH2 

204 AC -DP IDIS IELNKAKSDLEES KEWIRRSNQKLDS IGN-NH2 

205 Ac - P ID I S IELNKAKSDLEES KEWIRRSNQKLDS IGNW -NH2 
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206 Ac - IDIS IELNKAKSDLEESKEWIRRSNQKLDS IGNWH-NH2 

207 Ac -DIS IELNKAKSDLEESKEWIRRSNQKLDS IGNWHQ-NH2 
2 08 Ac - 1 S IELNKAKSDLEESKEWIRRSNQKLDS IGNWHQS -NH2 

209 Ac - S IELNKAKSDLEESKEWIRRSNQKLDS IGNWHQSS -NH2 

210 AC - IELNKAKSDLEESKEWIRRSNQKLDS IGNWHQS ST- NH2 

211 AC-ELNKAKSDLEESKEWIRRSNQKLDSIGNWHQSSTT-NH2 

212 Ac - ELRALRGELRALRGELRALRGELRALRGELRALRGK-NH2 

213 Ac - YTSL IHSL IEES QNQQQKNEQELLELDKWASLWNWF -NH2 

214 AC - YTSLIHSLIEESQNQQEKNEQELLELNKWASLWNWF-NH2 

215 AC-YTSLIHSLIEQSQNQQEKNEQELLELDKWASLWNWF-NH2 

216 AC-YTSLIHSLIQESQNQQEKNEQELLELDKWASLWNWF-NH2 

217 AC - YTSL IHSLIQQSQNQQQKNQQQLLQLNKWASLWNWF -NH2 

218 Ac - EQELLELDKWASLWNWF -NH2 

219 Ac - QELLELDKWASLWNWF - NH2 

220 Ac - ELLELDKWASLWNWF -NH2 

221 Ac - LELDKWASLWNWF -NH2 

222 Ac - ELDKWAS LWNWF - NH2 
22 6 Ac - WAS LWNWF - NH2 

227 Ac - ASLWNWF-NH2 

22 9 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLANAA-NH2 

23 0 Ac- YTSLIHSLIEESQNQQEKNEQQLLELDKWASLWNWF-NH2 
231 Ac - YTSL IQSLIEESQNQQEKNQQELLELDKWAS LWNWF -NH2 
234 Ac - EAAAREAAAREAAARLELDKWASLWNWF -NH2 

23 6 Ac -PS LRDP I S AE I S IQALS YALGGD INKVLEKLG YSG - NH2 

237 AC -SLRDPISAEIS IQALS YALGGD INKVLEKLGYSGG-NH2 

23 8 Ac - LRDP IS AE IS IQALS YALGGD INKVLEKLGYSGGD -NH2 

23 9 Ac -RDPISAEIS IQALS YALGGD INKVLEKLGYSGGDL-NH2 

240 Ac- DP IS AE IS IQALS YALGGD INKVLEKLGYSGGDLL-NH2 

241 Ac - P IS AE I S IQALS YALGGD INKVLEKLG YS GGDLLG - NH2 

242 Ac -ISAEIS IQALS YALGGDINKVLEKLGYSGGDLLGI-NH2 

243 Ac - SAEIS IQALS YALGGD INKVLEKLGYSGGDLLGIL -NH2 

244 Ac -AEIS IQALS YALGGD INKVLEKLGYSGGDLLGILE-NH2 

245 Ac - E I S IQALS YALGGD INKVLEKLG YS GGDLLG ILES -NH2 

246 AC-ISIQALSYALGGDINKVLEKLGYSGGDLLGILESR-NH2 

247 AC -S IQALS YALGGD INKVLEKLGYSGGDLLGILESRG-NH2 

24 8 Ac -IQALS YALGGD INKVLEKLG YSGGDLLGILESRG I -NH2 

24 9 Ac -QALS YALGGD INKVLEKLGYSGGDLLGILESRGIK-NH2 
2 50 Ac - ALS YALGGD INKVLEKLGYSGGDLLGILESRGIKA-NH2 

251 AC - LS YALGGDINKVLEKLGYSGGDLLGILESRGIKAR -NH2 

252 Ac - PDAVYLHRIDLGPPI SLERLDVGTNLGNAIAKLED -NH2 

253 Ac - DAVYLHRIDLGPP ISLERLDVGTNLGNAIAKLEDA -NH2 

254 AC - AVYLHRIDLGPPISLERLDVGTNLGNAIAKLEDAK-NH2 

255 Ac - VYLHRIDLGPPISLERLDVGTNLGNAIAKLEDAKE -NH2 

256 Ac - YLHRIDLGPP ISLERLDVGTNLGNAIAKLEDAKEL -NH2 

257 AC - LHRIDLGPP I SLERLD VGTNLGNAIAKLEDAKELL -NH2 

258 Ac -HRIDLGPPISLERLDVGTNLGNAIAKLEDAKELLE -NH2 

25 9 Ac -RIDLGPPISLERLDVGTNLGNAIAKLEDAKELLES -NH2 

260 Ac - IDLGP P I SLERLDVGTNLGNAIAKLEDAKELLES S -NH2 

261 Ac - DLGPP I S LERLDVGTNLGNAIAKLEDAKELLES SD -NH2 
62 Ac - LGPPISLERLDVGTNLGNAIAKLEDAKELLESSDQ-NH2 
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263 Ac - GPPISLERLDVGTNLGNAIAKLEDAKELLESSDQI -NH2 

264 Ac - PP I S LERLD VGTNLGNAI AKLEDAKELLES SDQ IL - NH2 

265 Ac -PIS LERLDVGTNLGNAIAKLED AKE LLES SDQ ILR - NH2 

266 AC - ISLERLDVGTNLGNAIAKLEDAKELLESSDQIRS -NH2 

267 AC - SLERLDVGTNLGNAIAKLEDAKELLES SDQILRSM -NH2 

268 Ac -LERLDVGTNLGNAIAKLEDAKELLESSDQILRSMK-NH2 
5 2 69 Ac - EWIRRSNQKLDS I -NH2 

270 AC - LELDKWASLANAF -NH2 

271 Ac - LELDKWASLFNFF -NH2 

272 Ac - LELDKWASLANWF -NH2 

273 Ac - LELDKWASLWNAF -NH2 

274 Ac -ELGNVNNS ISNALDKLEESNSKLDKVNVKLTSTSA-NH2 

275 Ac -TELGNVNNSISNALDKXEESNSKLDKVNVKLTSTS -NH2 

276 AC-STELGNVNNSISNALDKLEESNSKLDKVNVKLTST-NH2 
10 2 77 AC - 1 STELGNVNNS ISNALDKLEESNSKLDKVNVKLTS-NH2 

278 Ac - D I STELGNVNNS I SNALDKLEESNSKLDKVNVKLT - NH2 

279 AC -LDISTELGNVNNSISNALDKLEESNSKLDKVNVKL-NH2 
2 80 Ac -NLDISTELGNVNNSISNALDKLEESNSKLDKVNVK-NH2 
2 81 Ac -GNLDISTELGNVNNSISNALDKLEESNSKLDKVNV-NH2 
2 82 Ac -TGNLD I STELGNVNNS I SNALDKLEESNS KLDKVN-NH2 
2 83 AC -VTGNLDISTELGNVNNS I SNALDKLEESNSKLDKV- NH2 
2 84 AC - IVTGNLD I STELGNVNNS I SNALDKLEESNS KLDK-NH2 

15 285 AC-VIVTGNLDISTELGNVNNSISNALDKLEESNSKLD-NH2 

2 86 Ac -QVIVTGNLD I STELGNVNNS I SNALDKLEESNS KL-NH2 

2 87 Ac -S QVIVTGNLD I STELGNVNNS I SNALDKLEESNSK-NH2 

2 88 AC -DSQVIVTGNLDISTELGNVNNS ISNALDKLEESNS -NH2 

2 89 Ac -LDSQVIVTGNLD I STELGNVNNS I SNALDKLEESN-NH2 

2 90 Ac -ILDS QVIVTGNLD I STELGNVNNSISNALDKLEES-NH2 

291 Ac -SILDS QVIVTGNLD I STELGNVNNS I SNALDKLEE-NH2 

2 92 AC - ISILDSQVIVTGNLDISTELGNVNNSISNALDKLE-NH2 

2 0 293 Ac -NIS ILDSQVIVTGNLDISTELGNVNNS ISNALDKL-NH2 

294 . Ac -KNIS ILDSQVIVTGNLDISTELGNVNNS ISNALDK-NH2 

2 95 Ac - QKNIS ILDSQVIVTGNLDISTELGNVNNS ISNALD -NH2 

296 Ac -YQKNIS ILDSQVIVTGNLDISTELGNVNNS ISNAL-NH2 

297 Ac - TYQKNIS I LDS QVIVTGNLD I STELGNVNNS I SNA -NH2 

2 98 Ac -ATYQKN IS ILDSQVIVTGNLDISTELGNVNNS I SN-NH2 

299 Ac - DAT YQKNI S I LD S QVIVTGNLD I STE LGNVNNS I S -NH2 

300 Ac - FDATYQKNI S I LDS QVIVTGNLD I S TELGNVNNS I - NH2 
2 301 Ac - E FDATYQKNI S I LDS QVIVTGNLD I STELGNVNNS -NH2 

302 AC-GEFDATYQKNISILDSQVIVTGNLDISTELGNVNN-NH2 

303 AC-SGEFDATYQKNISILDSQVIVTGNLDISTELGNVN-NH2 

304 Ac -LSGEFDATYQKNISILDSQVIVTGNLDISTELGNV-NH2 

305 Ac -RLSGEFDATYQKNISILDS QVIVTGNLD I STELGN-NH2 

306 AC -LRLSGEFDATYQKNISILDSQVIVTGNLDISTELG-NH2 

3 07 Ac -TLRLSGE FDATYQKNI SILDS QVIVTGNLD I STEL -NH2 
3 0 8 Ac - ITLRLSGEFDATYQKNISILDSQVIVTGNLDISTE-NH2 

309 Ac - G ITLRLS GEFDATYQKN I S ILDSQVIVTGNLD 1ST- NH2 

310 Ac -TATIEAVHEVTDGLSQLAVAVGKMQQFVNDQFNNT-NH2 

311 AC - ITATIEAVHEVTDGLSQLAVAVGKMQQFVNDQFNN-NH2 

312 Ac - S ITATIEAVHE VTDGLS QLAVAVGKMQQFVNDQFN - NH2 
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314 Ac - KES ITAT IEAVHEVTDGLS QLAVAVGKMQQFVNDQ - NH2 

315 Ac -LKES ITAT IEAVHEVTDGLS QLAVAVGKMQQFVND-NH2 

3 16 Ac - RLKES ITATIEAVHEVTDGL SQLAVAVGKMQQFVN - NH2 

317 Ac - LRLKES ITATIEAVHEVTDGLSQLAVAVGKMQQFV-NH2 

318 Ac - ILRLKES ITATIEAVHEVTDGLSQLAVAVGKMQQF -NH2 
^ 319 Ac -NILRLKESITATIEAVHEVTDGLSQLAVAVGKMQQ-NH2 

320 Ac - ANILRLKESITATIEAVHEVTDGLSQLAVAVGKMQ-NH2 

321 Ac - AANILRLKES ITATIEAVHEVTDGLSQLAVAVGKM-NH2 

322 Ac - HKCDDECMNS VKNGTYDYPKYEEESKLNRNEIKGV-NH2 

323 Ac - KCDDECMNSVKNGTYDYPKYEEESKLNRNEIKGVK-NH2 

324 Ac - CDDECMNS VKNGTYD YPKYEEES KLNRNE IKGVKL -NH2 

325 Ac -DDECMNS VKNGTYD YPKYEEESKLNRNEIKGVKLS -NH2 
32 6 Ac -DECMNSVKNGTYDYPKYEEESKLNRNEIKGVKLSS -NH2 

32 7 Ac -ECMNS VKNGTYD YPKYEEES KLNRNE IKGVKLSSM-NH2 
10 328 Ac - CMNSVKNGTYDYPKYEEESKLNRNEIKGVKLSSMG-NH2 

329 Ac -MNS VKNGTYD YPKYEEESKLNRNEIKGVKLSSMGV-NH2 

33 0 Ac -NS VKNGTYD YPKYEEES KLNRNE I KG VKLSSMGVY-NH2 

331 Ac - S VKNGTYD YPKYEEES KLNRNE I KG VKLSSMGVYQ-NH2 

332 Ac - VKNGTYD YPKYEEES KLNRNE I KGVKLSSMGVYQ I -NH2 
33 3 Ac -KNGTYDYPKYEEESKLNRNEIKGVKLSSMGVYQIL-NH2 

334 AC -AFIRKSDELLHNV-NH2 

335 Ac - WLAGAALGVATAAQ ITAG I ALHQSMLNS QAIDNL - NH2 
15 33 6 Ac -VLAGAALGVATAAQITAGI ALHQSMLNS QAIDNLR-NH2 

337 Ac - LAGAALGVATAAQITAGIALHQSMLNS QAIDNLRA - NH2 

33 8 AC - AGAALGVATAAQ ITAG I ALHQSMLNS QAIDNLRAS -NH2 

33 9 Ac - GAALGVATAAQITAGIALHQSMLNS QAIDNLRAS L -NH 2 

34 0 Ac - AALGVATAAQ ITAG IALHQSMLNS QAIDNLRASLE -NH2 

341 AC - ALGVATAAQ ITAG I ALHQSMLNS QAIDNLRAS LET - NH2 

342 AC-LGVATAAQITAGIALHQSMLNSQAIDNLRASLETT-NH2 

343 Ac -GVATAAQITAGIALHQSMLNSQAIDNLRASLETTN-NH2 
2 0 344 Ac -VATAAQITAG IALHQSMLNS QAIDNLRAS LETTNQ-NH2 

34 5 AC - ATAAQ ITAG IALHQSMLNS QAIDNLRAS LETTNQA - NH2 

34 6 Ac - TAAQITAGI ALHQSMLNS QAIDNLRAS LETTNQAI -NH2 

347 AC-AAQITAGIALHQSMLNSQAIDNLRASLETTNQAIE-NH2 

34 8 AC -AQITAGIALHQSMLNSQAIDNLRASLETTNQAIEA-NH2 

34 9 AC-QITAGIALHQSMLNSQAIDNLRASLETTNQAIEAI-NH2 

350 Ac - ITAGIALHQSMLNSQAIDNLRASLETTNQAIEAIR-NH2 

351 AC - TAG IALHQSMLNS QAIDNLRASLETTNQAIEAIRQ -NH2 
2 5 352 Ac - AG IALHQ SMLNS QAIDNLRASLETTNQAIEAIRQA - NH2 

353 AC -GIALHQSMLNSQAIDNLRASLETTNQAIEAIRQAG-NH2 

354 Ac - IALHQSMLNSQAIDNLRASLETTNQAIEAIRQAGQ-NH2 

355 Ac -ALHQSMLNS QAIDNLRAS LETTNQAIEAIRQAGQE -NH2 

356 AC - LHQSMLNSQAIDNLRASLETTNQAIEAIRQAGQEM -NH2 

357 AC - HQSMLNS QAIDNLRAS LETTNQAI EAIRQAGQEM I -NH2 

35 8 Ac - QSMLNS QAIDNLRAS LETTNQAIEAIRQAGQEMIL -NH2 

359 Ac - SMLNS QAIDNLRAS LETTNQAIEAIRQAGQEMILA-NH2 

360 Ac - MLNS QAIDNLRAS LETTNQAI EAIRQAGQEM I LAV -NH2 

361 Ac - LNS QAIDNLRASLETTNQAIEAIRQAGQEMILAVQ -NH2 

362 Ac -NSQAIDNLRASLETTNQAIEAIRQAGQEMILAVQG-NH2 

363 Ac - SQAIDNLRASLETTNQAIEAIRQAGQEMILAVQGV -NH2 

: - 59 - 



30 



WO 01/51 673 PCI7US00/35727 



T 

No. Sequence 

3 64 AC -QAIDNLRASLETTNQAIEAIRQAGQEMILAVQGVQ-NH2 



3 65 Ac - AIDNLRAS LETTNQAI EAIRQ AGQEM ILAVQGVQD - NH2 

3 66 Ac - IDNLRASLETTNQAIEAIRQAGQEM ILAVQGVQD Y - NH2 ■ 

367 AC -DNLRASLETTNQAIEAIRQAGQEMILAVQGVQDYI -NH2 

368 AC -NLRASLETTNQAI EAIRQAGQEMILAVQGVQD YIN - NH2 

369 AC -LRASLETTNQAIEAIRQAGQEMILAVQGVQDYINN-NH2 

370 Ac - RASLETTNQAIEAIRQAGQEM ILAVQGVQDYINNE - NH2 

371 Ac -YTSVITIELSNIKENKUNGTDAVKLIKQELDKYK-NH2 

372 Ac -TSVITIELSNIKENKUNGTDAVKLIKQELDKYKN-NH2 

373 Ac - SVITIELSNIKENKUNGTDAVKLIKQELDKYKNA-NH2 
3 74 Ac - SNIKENKUNGTDAKVKLIKQELDKYKNAVTELQLL-NH2 

375 AC - KENKUNGTD AKVKL I KQELDKYKNAVTELQLLMQS - NH2 

376 Ac - CLELDKWASLWNWFC -NH2 

377 AC - CLELDKWASLANWFC -NH2 
10 378 Ac - CLELDKWASLFNFFC -NH2 

379 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWASLFNFF -NH2 

3 81 AC-RMKQLEDKVEELLSKNYHLENELELDKWASLWNWF-NH2 

3 82 Ac - KVEELLSKNYHLENELELDKWASLWNWF -NH2 

3 83 Ac - RMKQLEDKVEELLSKLEWIRRSNQKLDS I -NH2 

3 84 Ac - RMKQLEDKVEELLSKLAFIRKSDELLHNV-NH2 

3 8 5 Ac - ELEALRGELRALRGELELDKWASLWNWF-NH2 

3 86 AC - LDP ID IS IELNKAKSDLEESKEWIRRSNQKLDS I -NH2 

15 3 87 AC-CNEQLSDSFPVEFFQV-NH2 

3 8 8 Ac - MAEDDPYLGRPEQMFHLDPSL -NH2 

3 8 9 Ac - EDFSS IADMDFSALLSQISS -NH2 

3 90 Ac -TWQEWERKVDFLEENITALLEEAQIQQEKNMYELQ-NH2 

3 91 Ac - WQEWERKVDFLEENITALLEEAQ IQQEKNMYELQK-NH2 

3 92 Ac - QEWERKVDFLEENITALLEEAQIQQEKNMYELQKL -NH2 

3 93 Ac - EWERKVDFLEENITALLEEAQIQQEKNMYELQKLN -NH2 

3 94 Ac - WERKVDFLEENITALLEEAQIQQEKNMYELQKLNS -NH2 

2 q 3 95 AC-ERKVDFLEENITALLEEAQIQQEKNMYELQKLNSW-NH2 

3 96 AC - RKVDFLEENITALLEEAQIQQEKNMYELQKLNSWD -NH2 

3 97 AC-KVDFLEENITALLEEAQIQQEKNMYELQKLNSWDV-NH2 

3 9 8 Ac- VDFLEENITALLEEAQIQQEKNMYELQKLNSWDVF-NH2 

3 9 9 AC -DFLEENITALLEEAQIQQEKNMYELQKLNSWDVFG-NH2 

4 00 AC - FLEENITALLEEAQIQQEKNMYELQKLNSWDVFGN-NH2 
4 01 AC-LEENITALLEEAQIQQEKNMYELQKLNSWDVFGNW-NH2 
4 02 AC-LEENITALLEEAQIQQEKNMYELQKLNSWDVFGNWF-NH2 

2 5 4 03 Ac -NEQSEEKENELYWAKEQLLDLLFNIFNQTVGAWIMQ-NH2 

4 05 Ac -QQQLLDVVKRQQELLRLTVWGTKNLQTRVTAIEKYLKD-NH2 

4 06 Ac - QQLLD WKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQ -NH2 

4 07 Ac - QQLLDWKRQQELLRLTVWGPKNLQTRVTAIEKYLKDQ -NH2 

4 08 Ac - DERKQDKVLWQQTGTLQLTL IQLEKTAKLQWVRLNRY -NH2 

409 Ac -QQQLLDWKRQQELLRLTVWGTKNLQTRVTAIEKY-NH2 

410 Ac - QQLLDWKRQQELLRLTVWGTKNLQTRVTAIEKYL -NH2 

411 Ac - QLLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLK-NH2 

412 Ac - LLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLKD -NH2 

413 Ac - LD WKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQ -NH2 

4 14 AC - D WKRQQE LLRLTVWGTKNLQTRVTAI EKYLKDQ A - NH2 

415 Ac - WKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQAQ -NH2 
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416 AC - VKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQAQL-NH2 

417 AC - KRQQELLRLTVWGTKNLQTRVTAIEKYLKDQAQLN-NH2 

418 Ac -RQQELLRLTVWGTKNLQTRVTAIEKYLKDQAQLNA-NH2 

419 AC-QQELLRLTWGTKNLQTRVTAIEKYLKDQAQLNAW-NH2 

420 Ac - QELLRLTVWGTKNLQTRVTAI EKYLKDQAQLNAWG -NH2 

421 AC - ELLRLTVWGTKNLQTRVTAIEKYLKDQAQLNAWGC-NH2 

5 422 AC -NNLLRAIEAQQHLLQLTWGPKQLQARIIAVERYLKDQ -NH2 

423 Ac -SELEI KR YKNRVAS RKCRAKF KQLLQHYRE VAAAK - NH2 

424 Ac - ELEIKRYKNRVASRKCRAKFKQLLQHYREVAAAKS -NH2 

425 Ac-LE IKRYKNRVASRKCRAKFKQLLQHYREVAAAKS S -NH2 
42 6 Ac - E I KRYKNRVAS RKCRAKFKQLLQHYRE VAAAKS S E - NH2 
427 Ac - 1 KR YKNRVAS RKCRAKFKQLLQHYRE VAAAKS SEN-NH2 

42 8 Ac - KRYKNRVASRKCRAKFKQLLQHYREVAAAKSSEND -NH2 
429 Ac - RYKNRVASRKCRAKFKQLLQHYRE VAAAKS SENDR - NH2 

10 430 Ac - YKNRVAS RKCRAKFKQLLQHYRE VAAAKS S END RL -NH2 

4 31 Ac - KNRVAS RKCRAKFKQLLQHYRE VAAAKS S ENDRLR - NH2 

432 Ac -NRVASRKCRAKFKQLLQHYREVAAAKSSENDRLRL-NH2 

433 Ac - RVAS RKCRAKFKQLLQHYREVAAAKS S ENDRLRLL -NH2 

434 Ac - VAS RKCRAKFKQLLQHYREVAAAKS SENDRLRLLL - NH2 
4 3 5 Ac -AS RKCRAKFKQLLQHYREVAAAKS SENDRLRLLLK-NH2 

43 6 Ac - S RKCRAKFKQLLQHYREVAAAKS S ENDRLRLLLKQ - NH2 
437 AC - RKCRAKFKQLLQHYREVAAAKS S ENDRLRLLLKQM -NH2 

15 438 Ac - KCRAKFKQLLQHYREVAAAKS S ENDRLRLLLKQMC - NH2 

439 Ac - CRAKFKQLLQHYRE VAAAKS S ENDRLRLLLKQMCP -NH2 

440 AC - RAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPS -NH2 

441 Ac -AKF KQLLQHYRE VAAAKS S END RLRLLLKQMCPSL-NH2 

442 Ac - KFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLD-NH2 

44 3 Ac - FKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDV-NH2 

444 Ac - KQLLQHYRE VAAAKS S ENDRLRLLLKQMCP S LD VD - NH2 

445 Ac - QLLQHYRE VAAAKS SENDRLRLLLKQMCPSLDVDS-NH2 
2Q 446 Ac -LLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSI -NH2 

447 AC-LQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSII-NH2 

44 8 Ac - QHYREVAAAKSSENDRLRLLLKQMCPSLDVDS I IP-NH2 

44 9 AC-HYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPR-NH2 

450 Ac - YREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRT-NH2 

451 Ac - RE VAAAKS SENDRLRLLLKQMCPSLDVDS I IPRTP - NH2 

452 Ac - E VAAAKSSENDRLRLLLKQMCPSLDVDS I IPRTPD - NH2 

453 Ac -VAAAKS SENDRLRLLLKQMCPSLDVDS I IPRTPDV-NH2 
4 54 Ac -AAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPDVL-NH2 

455 AC -AAKS SENDRLRLLLKQMCPSLDVDS 1 1 PRTPDVLH - NH2 

456 Ac -AKS SENDRLRLLLKQMCPSLDVDS I IPRTPDVLHE-NH2 

457 Ac -KS SENDRLRLLLKQMCPSLDVDS I IPRTPD VLHED -NH2 

458 AC - S S ENDRLRLLLKQMCP S LD VDS I IPRTPDVLHEDL - NH2 

459 AC - SENDRLRLLLKQMCPSLDVDS I IPRTPDVLHEDLL-NH2 
4 60 Ac - ENDRLRLLLKQMCPS LDVDS 1 1 PRTPDVLHEDLLN - NH2 
4 61 AC -NDRLRLLLKQMCPSLDVDS I IPRTPD VLHEDLLNF - NH2 
534 Ac - PGYRWMCLRRFIIFLFILLLCLIFLLVLLDYQGML-NH2 

30 535 Ac - G YRWMCLRRF 1 1 FLF I LLLCL I FLL VLLD YQGMLP - NH2 

536 AC - YRWMCLRRF 1 1 FLF ILLLCL I FLLVLLD YQGMLP V - NH2 

537 AC-RWMCLRRFIIFLFILLLCLIFLLVLLDYQGMLPVC-NH2 
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Sequence 

AC-WMCLRRFIIFLFILLLCLIFLLVLLDYQGMLPVCP-NH2 

Ac - MCLRRF 1 1 FLFILLLCL I FLLVLLD YQGMLP VC PL - NH2 

Ac - CLRRFI I FLFILLLCL IFLLVLLDYQGMLPVCPL I -NH2 

AC-LRRFIIFLFILLLCLIFLLVLLDYQGMLPVCPLIP-NH2 

Ac - RRFI I FLF ILLLCLI FLLVLLD YQGMLP VCPL IPG - NH2 

Ac - RF I IFLF ILLLCL IFLLVLLDYQGMLPVCPLI PGS -NH2 

AC-FIIFLFILLLCLIFLLVLLDYQGMLPVCPLIPGSS-NH2 

Ac - 1 1 FLF ILLLCL IFLLVLLD YQGMLPVCPLI PGS ST - NH2 

Ac - IFLF ILLLCL I FLLVLLD YQGMLP VCPLI PGS S XT - NH2 

AC-FLFILLLCLIFLLVLLDYQGMLPVCPLIPGSSTTS-NH2 

AC-LFILLLCLIFLLVLLDYQGMLPVCPLIPGSSTTST-NH2 

AC-FILLLCLIFLLVLLDYQGMLPVCPLIPGSSTTSTG-NH2 

Ac - ILLLCLIFLLVLLDYQGMLPVCPLIPGSSTTSTGP-NH2 

AC-LLLCLIFLLVLLDYQGMLPVCPLIPGSSTTSTGPC-NH2 

AC-LLCLIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCR-NH2 

AC-LCLIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRT-NH2 

AC-CLIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTC-NH2 

AC-LIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCM-NH2 

Ac - IFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCMT-NH2 

AC-FLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCMTT-NH2 

Ac - PPLVLQAGFFLLTRILTIPQSLDS WWTSLNFLGGT-NH2 

Ac - LLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTT-NH2 

AC-LVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTV-NH2 

AC-VLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVC-NH2 

AC-LQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCL-NH2 

AC-QAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLG-NH2 

AC-AGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQ-NH2 

AC-GFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQN-KH2 

Ac - FFLLTRILTIPQSLDS WWTSLNFLGGTTVCLGQNS -NH2 

Ac - FLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQ-NH2 

AC-LLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQS-NH2 

AC-LTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQSP-NH2 

AC-FWNWLSAWKDLELKSLLEEVKDELQKMR-NH2 

Ac -NNLLRAIEAQQHLLQLTVW-NH2 

Ac - CGGNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ -NH2 

Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

C13H27CO-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

AC-AVSKGYLSALRTGWYTSVITIELSNIKENKUNGTDA-NH2 

Ac - S ISNIETVIEFQQKNNRLLEITREFS WAGVTTPVS -NH2 

AC-DQQIKQYKRLLDRLIIPLYDGLRQKDVIVSNQESN-NH2 

Ac - YSELTNIFGDNIGSLQEKGIKLQGIASLYRTNITEI -NH2 

AC - TS ITLQVRLPLLTRLLNTQI YRVDS IS YNIQNREWY-NH2 

AC - VE I AE YRRLLRTVLE P IRDALNAMTQNIRPVQS VA -NH2 

Ac - S YFIVLS IAYPTLSEIKGVIVHRLEGVS YNIGSQEW-NH2 

Ac -LKEAIRDTNKAVQSVQSS IGNLIVAIKS -NH2 

NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

QKQEP IDKELYPLTSL 

YPKFVKQNTLKLAT 
QYIKANQKFIGITE 
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5 87 NGQIGNDPNRDILY 

5 88 AC-RPDVY-OH 

589 CLELDKWASLWNWFC- (cyclic) 

590 CLELDKWASLANWFC - ( cy c 1 i c ) 

591 CLELDKWASLANFFC- (cyclic) 

594 AC-1WLLRAIEAQQQHLLQLTWGIKQLQARILAVERYLKDQ-NH2 

595 AC-CGGYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNNWF-NH2 

596 Ac - PLLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGT-NH2 

597 Ac - LL VLQAGFFLLTRI LT I PQS LDS WWTS LNFLGGTT -NH2 

598 AC-LVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTV-NH2 

599 Ac - VLQAGFFLLTRI LT I PQSLDSWWTSLNFLGGTTVC-NH2 

600 Ac - LQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCL -NH2 

601 Ac - QAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLG -NH2 

602 AC-AGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQ-NH2 

603 Ac - GFFLLTRILTI PQSLDSWWTSLNFLGGTTVCLGQN- NH2 

604 Ac - FFLLTRILTI PQS LDS WWTS LNFLGGTTVCLGQNS -NH2 

605 Ac - FLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQ -NH2 

606 Ac -LLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQS -NH2 

607 Ac - LTRILT I PQS LDS WWTSLNFLGGTTVCLGQNS QSP - NH2 

608 Ac - LELDKWASLWNWA -NH2 

609 AC-LELDKWASAWNWF-NH2 

610 AC - LELDKAASLWNWF -NH2 

611 Ac -LKLDKWASLWNWF-NH2 

612 Ac - LELKKWASLWNWF -NH2 

613 Ac - DELLHNVNAGRS T -NH2 

614 Ac - KSDELLHNVNAGKST -NH2 

615 Ac - IRKSDELLHNVNAGKST-NH2 

616 Ac - AFIRKSDELLHNVNAGKST-NH2 

617 Ac - FDAS I S QVNEKINQS LAF I - NH2 

618 Ac - YAADKESTQKAFDGITNKVNS V1EKMNTQFEAVGKE -NH2 

619 Ac - S VIEKMNTQFEAVGKEFGNLERRLENLNKRMEDGFL -NH2 

62 0 AC-VWTYNAELLVLMENERTLDFHDSNVKNLYDKVRMQL-NH2 

621 AC-EWDREINNYTSLIHSLIEESQNQQEKNEQEGGC-NH2 

622 Ac - INNYTSLIHSLIEESQNQQEKNEQELLELDKWASL -NH2 

623 AC - INNYTSLIHSLIEESQNQQEKNEQELLE -NH2 

624 AC -WMEWDRE INNYTSLIHSLIEESQNQQEKNEQELLE -NH2 

625 AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

626 Ac - IDIS IELNKAKSDLEESKEWIKKSNQKLDS IGNWH -NH2 

627 Ac -NQQEKNEQELLELDKWASLWNWFNITNWLWYIKIFI -NH2 

627 Ac -NQQEKNEQELLELDKWASLWNWFNITNWLWYIKIFI -NH2 

628 Ac - QNQQEKNEQELLELDKWASLWNWFNITNWLWYIKI F -NH2 

629 Ac - SQNQQEKNEQELLELDKWASLWNWFNITNWLWYIKI -NH2 

63 0 Ac -ESQNQQEKNEQELLELDKWASLWNWFNITNWLWYIK-NH2 

631 Ac - EESQNQQEKNEQELLELDKWASLWNWFNITNWLWYI -NH2 

632 Ac - IEESQNQQEKNEQELLELDKWASLWNWFNITNWLWY-NH2 

633 AC -LIEESQNQQEKNEQELLELDKWASLWNWFNITNWLW-NH2 

634 AC - SLIEESQNQQEKNEQELLELDKWASLWNWFNITNWL -NH2 

635 AC -HSLIEESQNQQEKNEQELLELDKWASLWNWFNITNW-NH2 
63 6 Ac - IHSLIEESQNQQEKNEQELLELDKWASLWNWFNITN-NH2 
637 Ac - LIHSLIEESQNQQEKNEQELLELDKWASLWNWFNIT -NH2 



- 63 - 



WO 01/51 673 PCT7US00/35727 



T 

No. Sequence 

638 Ac - SLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNI -NH2 



639 Ac -TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFN-NH2 

640 Ac -NYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW-NH2 

641 Ac -NNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-NH2 

642 Ac - INNYTSLIHSLIEESQNQQEKNEQELLELDKWASLW-NH2 

643 Ac - EINNYTSLIHSLIEESQNQQEKNEQELLELDKWASL -NH2 

644 Ac -REINNYTSLIHSLIEESQNQQEKNEQELLELDKWAS -NH2 

645 AC-DREINNYTSLIHSLIEESQNQQEKNEQELLELDKWA-NH2 

646 Ac - WDRE INNYTSL IHSL IEES QNQQEKNEQELLELDKW - NH2 

647 AC-EWDREINNYTSLIHSLIEESQNQQEKNEQELLELDK-NH2 
64 8 Ac -MEWDREINNYTSLIHSLIEESQNQQEKNEQELLELD -NH 

649 AC-WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

650 AC-TWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLE-NH2 

651 Ac - MTWMEWDRE INNYTS LIHSLIEES QNQQE KNEQELL - NH2 
10 652 AC-NMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQEL-NH2 

653 Ac - NNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQE-NH2 

654 AC-WNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQ-NH2 

655 Ac - 1 WNNMTWME WDRE INNYTS LIHSLIEES QNQQEKNE - NH2 

656 AC-QIWNNOTWMEWDREINNYTSLIHSLIEESQNQQEKN-NH2 

657 AC - EQIWNNMTWMEWDREINNYTSL IHSLIEESQNQQEK-NH2 

658 AC-LEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQE-NH2 

659 AC-SLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQ-NH2 
15 €60 Ac - KSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQ -NH2 

661 AC-NKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQN-NH2 

662 AC-SLAFIRKSDELLHNVNAGKST-NH2 

663 Ac - FDASISQVNEKINQSLAFIRK-NH2 

664 AC-YTSLIHSLIEESQQQQEKQEQELLELDKWASLWNWF-NH2 

665 AC-FDASISQVNEKINQSLAFIRKSDELLHNVNAGK-NH2 

666 Ac - FDAS ISQVNEKINQSLAFIRKSDELLHNVNA-NH2 

667 Ac - FDAS ISQVNEKINQSLAFIRKSDELLHNV-NH2 
20 668 Ac - FDAS ISQVNEKINQSLAFIRKSDELLH-NH2 

669 Ac - FDAS I SQVNEKINQSLAF IRKS DEL -NH2 

670 Ac - FDAS I SQVNEKINQSLAF IRKSD -NH2 

671 AC-ASISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

672 Ac - ISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

673 AC-QVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

674 Ac -NEKINQSLAFIRKSDELLHNVNAGKST-NH2 

675 Ac -KINQSLAFIRKSDELLHNVNAGKST-NH2 
25 676 Ac -NQSLAFIRKSDELLHNVNAGKST-NH2 

677 Ac - FWNWLS AWKDLELYPGSLELDKWASLWNWF -NH2 

678 Ac - CGGNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ -NH2 

679 Ac - CGGYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 
YTS L IHS L IEES QNQQE KNEQELLELDKWASLWNWF 
NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ 
Ac -EKNMYELQKLNSWDVFTNWLDFTSWVRYIQYIQYGV-NH2 

683 Ac - QEKNMYELQKLNSWDVFTNWLDFTSWVRYIQYIQYG -NH2 

684 AC " QQEKOTraL Q mN SWDVFTNWLDFTSWVRYIQYIQY-NH2 
a q c Ac . IQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQYIQ-NH2 

Ac -QIQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQYI -NH2 
Ac -AQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQY-NH2 



680 
681 
682 



684 
685 
686 
687 
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688 AC-QAQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQ-NH2 

68 9 Ac -EQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRYI-NH2 

690 AC -LEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRY-NH2 

691 AC -SLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWVR-NH2 

6 92 AC -QSLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWV-NH2 

693 AC-SQSLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSW-NH2 

694 Ac - ISQSLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTS -NH2 

695 Ac -NISQSLEQAQIQQEKNMYELQKLNSWDVFTNWLDFT-NH2 

696 Ac -ANISQSLEQAQIQQEKNMYELQKLNSWDVFTNWLDF-NH2 

697 Ac - EANI S QS LEQAQ IQQEKNMYELQKLNS WD VFTNWLD - NH2 
699 Ac - YLEANISQSLEQAQIQQEKNMYELQKLNSWDVFTNW-NH2 

7 00 Ac -YTSLIHSLIEESQNQQEKNEQEL-NH2 

701 AC-YTSLIHSLIEESQNLQEKNEQELLELDKWASLWNWF-NH2 

702 Ac - YTS L IHS L I EES QNQQEKLEQELLELDKWASLWNWF - NH2 

703 AC-YTSLIHSLIEESQNQQEKNEQELLEFDKWASLWNWF-NH2 

704 Ac - YTS L IHS L IEESQNQQEKNEQELLELDKPASLWNWF -NH2 

705 Ac- YTSLIHSLIEESQNQQEKNEQELLELDKWASPWNWF-NH2 
7 06 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNSF-NH2 

707 Biotin NH (CH2 ) 4C0-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

708 Biotin NH (CH2) 6CO - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2. 

709 FMOC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 

710 FMOC-NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ 

711 AC-EWDREINNYTSLIHSLIEESQNQQEKNEQE-NH2 

712 AC - L IEESQNQQEKNEQELLELDKWASLWNWF -NH2 

713 AC - FWNWLSAWKDLELGGPGSGPGGLELDKWASLWNWF -NH2 

714 AC-LIHSLIEESQNQQEKNEQELLELDKWASL-NH2 

715 Ac - TS L IHS L I EES QNQQEKNEQELLELDKWASLWNWF - NH2 

716 AC-LIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

718 FMO C - GGGGG YTS L IHSLIEESQNQQEKNEQELLELDKWAS LWNWF - NH2 

719 Ac-HSL IEES QNQQEKNEQELLELDKWASLWNWF - NH2 

72 0 Ac- YTSLIYSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

721 AC-YTSLIHSLIEKSQNQQEKNEQELLELDKWASLWNWF-NH2 

722 AC - YTSLIHSS IEESQNQQEKNEQELLELDKWASLWNWF-NH2 

723 Ac - LEANI SQLLEQAQ IQQEKNM YELQKLNSWDVFTNWL -NH2 

724 AC-SLEECDSELEIKRYKNRVASRKCRAKFKQLLQHYR-NH2 
72 5 Ac -LEECDSELEIKRYKNRVASRKCRAKFKQLLQHYRE -NH2 

726 Ac - EE CDS ELE IKRYKNRVASRKCRAKFKQLLQHYREV -NH2 

727 Ac -ECDSELEIKRYKNRVASRKCRAKFKQLLQHYREVA-NH2 
72 8 AC - CDSELE IKRYKNRVAS RKCRAKFKQLLQHYRE VAA -NH2 

72 9 AC -DSELE IKRYKNRVASRKCRAKFKQLLQHYREVAAA-NH2 

730 Desaminotyrosine-FDASISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

731 WASLWNW-NH2 

732 Ac -EAQQHLLQLTWGIKQLQARILAVERYLKDQQLLGIWG-NH2 

733 Ac - IEAQQHLLQLTVWGIKQLQARILAVERYLKDQQI4LGIW-NH2 

734 AC - AIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLGI -NH2 

735 Ac -RAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLG-NH2 

73 6 Ac - LRAIEAQQHLLQIiTVWGIKQLQARILAVERYLKDQQLL-NH2 
737 Ac -LLRAIEAQQHLLQLTWGIKQLQARILAVERYLKDQQL-NH2 
73 8 Ac -NLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQ-NH2 
739 Ac - QNNLLRAIEAQQHLLQLTVWG I KQLQARILAVERYLKD - NH2 
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740 




741 




742 




743 




744 




745 


5 


758 




760 




764 




765 




766 




767 




768 




769 


Xv 


770 




771 




772 




773 




774 




775 




776 




777 


15 


778 




779 




780 




781 




782 




783 




784 




785 


20 


78 6 




787 




788 




789 




790 




791 




792 




793 


25 


793 


794 




795 




796 




797 




798 




800 




801 




802 


30 


803 




804 




805 



Sequence 



Ac - QQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLK-NH2 

AC-QQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYL-NH2 

AC-VQQQNNLLRAIEAQQHLLQLTWGIKQLQARILAVERY-NH2 

AC - IVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVER-NH2 

Ac - GI VQQQNNLLRAIEAQQHLLQLTVWGIKQIiQARILAVE -NH2 

Ac - SG I VQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAV-NH2 

Ac - RSMTLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQLTV-NH2 

AC-GARSMTLTVQARQIiLSGIVQQQNNLLRAIEAQQHLLQL-NH2 

Ac - GSTMGARSMTLTVQARQLLSGIVQQQNNLLRAIEAQQH-NH2 

AC-GSTMGARSMTLTVQARQLLSGIVQQQNNLLRAIEAQQH-NH2 

AC-EGSTMGARSMTLTVQARQLLSGIVQQQNNLLRAIEAQQ-NH2 

Ac - RAKFKQLLQHYREVAAAKSSENDRLRLL -NH2 

Ac - AKFKQLLQHYREVAAAKSSENDRLRLLL -NH2 

Ac - KFKQLLQHYREVAAAKSSENDRLRLLLK-NH2 

Ac - PKQLLQHYREVAAAKSSENDRLRLLLKQ -NH2 

Ac - RAKFKQELQHYREVAAAKSSENDRLRLLLKQMCPS -NH2 

DKWAS LWNWF -NH2 

B i o t in - FDAS ISQVNEKINQSLAFIRKSDELLHNVNAGKST -NH2 
AC-YDASISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 
AC -YDASISQVNEKINQSLAYIRKSDELLHNVNAGKST-NH2 
Ac - FDAS I SQVNE KINQSLAYIRKSDELLHNVNAGKST - NH2 
AC - FDAS ISQVQEKIQQSLAFIRKSDELLHQVQAGKST -NH2 
AC - FDAS I SQVNEKINQALAFIRKADELLHNVNAGKST -NH2 
Ac - FDAS ISQVNEKINQALAFIRKSDELLHNVNAGKST-NH2 
Ac - FDAS ISQVNEKINQSLAFIRKADELLHNVNAGKST -NH2 
Ac - YDAS ISQVQEE IQQALAFIRKADELLEQVQAGKST -NH2 
AC-FDASISQVNEKINQSLAFIRKSDELLENVNAGKST-NH2 
Ac - FDAS ISQVNEEINQSLAFIRKSDELLHNVNAGKST -NH2 
AC-VFPSDEFDASISQVNEKINQSLAFIRKSDELLENV-NH2 
AC-VFPSDEFDASISQVNEEINQSLAFIRKSDELLENV-NH2 
Ac - VYPSDEYDASISQWEEINQALAYIRKADELLENV-NH2 
Ac - VFPSDEFDAS I S QVNEE INQS LAF IRKSDELLHNV -NH2 
AC-SNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQ-NH2 
AC-WSNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEES-NH2 
Ac - SWSNKSLEQIWNNMTWMEWDREINWYTSLIHSLIEE -NH2 
Ac -ASWSNKSLEQIWNNNTWMEWDREINNYTSLIHSLIE-NH2 
AC-NASWSNKSLEQIWNNMTWMEWDREINNYTSLIHSLI-NH2 
AC-WNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHSL-NH2 
Ac -WNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHSL-NH2 
Ac - PWNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHS -NH2 
Ac - VP WNAS WSNKS LEQ I WNNMTWME WDRE INNYTS L IH - NH2 
Ac -AVPWNASWSNKSLEQIWNIJMTWMEWDREINNYTSLI -NH2 
Ac -TAVPWNASWSWKSLEQIWNNKTWMEWDREINNYTSL-NH2 
Ac -TTAVPWNASWSNKSLEQIWNNMTWME WDRE INNYTS -NH2 
Ac -AAASDE FDAS I SQVNEKINQS LAF IRKSDELLHNV -NH2 
AC-VFPAAAFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 
Ac - VFPSDEAAASISQVNEKINQSLAFIRKSDELLHNV-NH2 
Ac - VFPSDEFDAAAAQVNEKINQSLAFIRKSDELLHNV-NH2 
Ac -VFPSDEFDAS ISAAAEKINQSLAFIRKSDELLHNV-NH2 
AC - VFPSDEFDAS ISQVNAAANQSLAFIRKSDELLHNV-NH2 
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806 AC-VFPSDEFDASISQVNEKIAAALAFIRKSDELLHNV-NH2 

807 AC-VFPSDEFDASISQVNEKINQSAAAIRKSDELLHNV-NH2 

808 AC - VFPSDEFDAS ISQVNEKINQSLAFAAASDELLHNV-NH2 

809 AC - VFPSDEFDASISQVNEKINQSLAFIRKAAALLHNV-NH2 

810 AC-VFPSDEFDASISQVNEKINQSLAFIRKSDEAAANV-NH2 

811 AC-VFPSDEFDASISQVNEKINQSLAFIRKSDELLAAA-NH2 

812 Ac - VYPSDEFDAS ISQVNEKINQSLAFIRKSDELLHNV-NH2 

813 AC-AAAAIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

814 AC-YTSLIHSLIEESQQQQEKNEQELLELDKWASLWNWF-NH2 

815 Ac - YTS L IHSL IEES QNQQEKQEQELLELDKWASLWNWF - NH2 

816 Ac - Q I WNNMTWME WDRE INNYTS L IHS L I EE S QNQQEKQ -NE2 

817 Ac -QIWNNMTWMEWDREINNYTSL IHSL IEES QQQQEKN-NH2 

818 AC-QIWNNMTWMEWDREINNYTSLIHSLIEESQQQQEKQ-NH2 

819 AC-NKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQQ-NH2 

820 Ac -FDAS ISQVNEKINQSLAFIEESDELLHNVNAGKST-NH2 

821 Ac -ACIRKSDELCL -NH2 

823 Ac - YTSLIHSLIEESQNQQEKDEQELLELDKWASLWNWF-NH2 

824 Ac - YTSL IHS L I EE S QDQQEKNEQELLELDKWASLWNWF - NH2 

825 Ac - YTSLIHSLIEESQDQQEKDEQELLELDKWASLWNWF -NH2 

826 Ac - YTSLIHS L IEESQNQQEKNEQELLELDKWASLWDWF -NH2 

841 AC - LEANITQSLEQAQIQQEKNMYELQKLNSWDVFTNWL -NH2 

842 Ac - LEANISASLEQAQIQQEKNMYELQKLNS WD VFTNWL -NH2 

843 Ac -LEANISALLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

844 AC - LEANITALLEQAQIQQEKNMYELQKLNSWDVFTNWL -NH2 

845 Ac -LEANITASLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

845 Ac -LEANITASLEQAQIQQEKNMYELQKLNSWDVFTNWL -NH2 

846 AC - RAKFKQLLQHYREVAAAKSSENDRLRLLLKQMUPS -NH2 

847 Ac - Abu -DDE - Abu -MNS VKNGTYDYPKYEEES KLNRNE IKGVKL -NH2 
856 Ac -WQEWEQKVRYLEANISQSLEQAQIQQEKNMYELQKL-NH2 

860 Ac -DEYDASISQVNEKINQSLAFIRKSDELLHNVNAGK-NH2 

861 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-NH2 

862 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLW-NH2 

863 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASL-NH2 

864 Ac -YTSL IHSL IEES QNQQEKNEQELLELDKWAS-NH2 

865 AC-QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

866 Ac -DRE INNYTS L IHSL IEESQNQQEKNEQELLELDKWASLWNWF -NH2 

867 AC-NNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDK-NH2 

868 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWAAA-NH2 

869 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWAAAANWF-NH2 

870 Ac- YTSL IHSL IEES QNQQEKNEQELLELDAAASLWNWF-NH2 

871 Ac - YTSL IHSL IEESQNQQEKNEQELLAAAKWASLWNWF -NH2 

872 AC-YTSLIHSLIEESQNQQEKNEQAAAELDKWASLWNWF-NH2 

873 Ac - YTSLIHSLIEESQNQQEKAAAELLELDKWASLWNWF-NH2 

874 AC -YTSL IHS LIEESQNQAAANEQELLELDKWASLWNWF-NH2 

875 AC-YTSLIHSLIEESAAAQEKNEQELLELDKWASLWNWF-NH2 

876 Ac - YTS L IHS L I AAAQNQQEKNEQELLELDKWAS L WNWF -NH2 

877 Ac -YTSLIHAAAEESQNQQEKNEQELLELDKWASLWNWF-NH2 

878 Ac -YTSAAASLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

879 AC -EIWNNMTWMEWDRENEKINQSLAFIRKSDELLHNV-NH2 

8 80 Ac - YISEVNEEINQSLAFIRKADELLENVDKWASLWNWF -NH2 



- 67 - 



No . Sequence 



881 AC-TSVITIELSNIKENKANGTDAKVKLIKQELDKYKN-NH2 ~ ~" 

882 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFMG-NH2 

883 AC-NEKINQSLAFIRKSDELLHNV-NH2 

884 Biotin-YDPLVFPSDEFDASISQVNEKINQSLAFIRKSDEL-NH2 

885 Biotin-PLVFPSDEFDASISQVNEKINQSLAFIRKSDELLH-NH2 

886 Biotin-VFPSDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

887 Biotin-DEFDASISQVNEKINQSLAFIRKSDELLHNVNAGK-NH2 

888 Biotin-VYPSDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

889 Biotin-VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

890 AC-VYPSDEFDASISQVQEEIQQALAFIRKADELLEQV-NH2 

891 AC-NYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

892 AC-NNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

893 Ac - INNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

894 Ac - E INNYTSL IHSLIEES QNQQE KNE QELLELDKWAS LWNWF -NH2 

895 Ac - YTS L IHS L IEE SQNQQEKNEQELLELDKWAS L WNWFN -NH2 

896 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNI -NH2 

897 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNIT-NH2 

898 Ac - YTS L IHS L IEESQNQQEKNEQELLELDKWASLWNWFNITN -NH2 

899 AC-YDPLVFPSDEFDASISQVNEKINQSLAFIRKSDELLHNVNAGK-NH2 

900 AC-NYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFN-NH2 

901 AC-NNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNI-NH2 

905 AC - KCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDS I IPRTPD -NH2 

906 AC-RAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD-NH2 

907 Ac -VYPSDEYDASISQVNEEINQALAYIAAADELLENV-NH2 

909 Ac -YDASISQVNEEINQALAYIRKADELL-NH2 

910 Ac -M-Nle -WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL -NH2 

911 AC - KNGTYD YPKYEEESKLNRNEIKGVKLS SMGVYQI -NH2 

912 Ac - VTEKIQMASDNINDLIQSGVNTRLLTIQSHVQNYI -NH2 

913 QNQQEKNEQELLELDKWASLWNWF -NH2 

914 Ac - QNQQEKNEQELLELDKWASLWNWF -NH2 

915 LWNWF -NH2 

916 ELLELDKWASLWNWF - NH2 

917 E KNEQELLELDKWAS LWNWF - NH2 

918 SL IE E SQNQQEKNEQELLELDKWAS LWNWF -NH2 

919 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW 
92 0 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN 

921 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLW 

922 Ac-YTSLIHSLIEESQNQQEKNEQELLELDKWASL 

923 TS L IHSL IEES QNQQE KNEQELLELDKWAS LWNWF -NH2 

924 SLIHSLIEES QNQQE KNEQELLELD KWAS LWNWF - NH2 

925 LIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 
92 6 IHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

94 0 Ac -AAVALLPAVLLALLAPSELEIKRYKNRVASRKCRAKFKQLLQHYREVAAAK-NH2 

94 1 Ac -AAVALLPAVLLALLAPCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCP-NH2 

942 AC-YTSLIHSLIEESQNQQEKNNNIERDWEMWTMNNWIQ-NH2 

944 VYPSDEYDAS ISQVNEE INQALAYIRKADELLENV -NH2 

945 Ac -LMQLARQLMQLARQMKQLADSLMQLARQVSRLESA-NH2 
94 6 Ac - WMEWDRE INNYTSL IHSL IEESQNQQEKNEQELL -NH2 

947 Ac -MEWDRE INNYTSL IHSLIEESQNQQEKNEQELLEL ~NH2 

948 Ac -EWDRE INNYTSL IHSLIEES QNQQE KNEQELLEL-NH2 
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949 AC-MEWDREINNYTSLIHSLIEESQNQQEKNEQELLE-NH2 — — 

950 Biotin-W-Nle-EWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

951 AC-YLEYDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

952 Ac - 1 KQFINMWQE VGKAMYA -NH2 

953 AC - IRKS DELL -NH 2 

954 Dec anoy 1 - IRKSDELL -NH2 

955 Ace ty 1 - Aca - Aca - IRKSDELL -NH2 

956 AC-YDASISQV-NH2 

957 Ac -NEKINQSL-NH2 

958 Ac - S ISQVNEEINQALAYIRKADELL -NH2 

959 AC - QVNEEINQALAYIRKADELL -NH2 

960 AC - EE INQALAYI RKADELL -NH 

961 AC - NQALAYIRKADELL -NH2 

962 Ac - LAYIRKADELL -NH2 

963 FDASISQVNEKINQALAFIRKSDELL-NH2 

964 Ac-W-Nle-EWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

965 Ac -ASRKCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDS -NH2 

967 AC-WLEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

968 AC-YVKGEPIINFYDPLVFPSDEFDASISQVNEKINQSL-NH2 

969 AC-VYPSDEYDASISQVNEEINQSLAYIRKADELLHNV-NH2 

970 AC - YDAS I SQVNEE INQALAYI RKADELLENV-NH2 

971 Ac - YDAS I SQVNEE INQALAYIRKADELLE -NH2 

972 Ac -VYPSDEYDASISQVNEEINQALAYIRKAAELLHNV-NH2 

973 Ac - VYPSDEYDASISQVNEEINQALAYIRKALELLHNV-NH2 

974 Dec anoy 1-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

975 AC - VYPSDEYDAS ISQVNEEINQLLAYIRKLDELLENV-NH2 
97 6 AC -DEYDAS IS QVNEKINQSLAFIRKSDELL -NH2 

977 Ac -SNDQGSGYAADKESTQKAFDGITNKVNSVIEKTNT-NH2 

978 AC-ESTQKAFDGITNKVNSVIEKTNTQFEAVGKEFGNLEKR-NH2 

97 9 Ac -DGITNKVNSVIEKTNTQFEAVGKEFGNLEKRLENLNK-NH2 

980 Ac -DSNVKNL YDKVRS QLRDNVKELGNGAFEFYHK-NH2 

981 AC - RDNVKELGNGAFE FYHKADDEALNS VKNGTYD YPKY - NH2 

982 Ac - E FYHKADDEALNS VKNGTYD YPKY -NH2 

983 Ac - AAVALLPAVLLALLAPAADKESTQKAFDGITNKVNS -NH2 

984 Ac - AAVALL P AVLLALLAP AAD SNVKNL YDKVRS QLRDN - NH2 

985 Ac - KESTQKAFDGITNKVNSV-NH2 

986 AC - IEKTNTQFEAVGKEFGNLER-NH2 

987 Ac - RLENLNKRVEDGFLDVWTYNAELLVALENE -NH2 

98 8 Ac - SNVKNL YDKVRS QLRDN -NH2 

989 AC-WMEWDREINNYTSLIHSLIEESQNQQEKNEQEL-NH2 

990 Ac - WMEWDREINNYTSLIHSLIEESQNQQEKNEQE -NH2 

991 AC-MEWDREINNYTSLIHSLIEESQNQQEKNEQEL-NH2 

992 AC-MEWDREINNYTSLIHSLIEESQNQQEKNEQE-NH2 

993 AC-EWDREINNYTSLIHSLIEESQNQQEKNEQELLE-NH2 

994 AC-EWDREINNYTSLIHSLIEESQNQQEKNEQELL-NH2 

995 AC-EWDREINNYTSLIHSLIEESQNQQEKNEQEL-NH2 

996 AC-YTKFIYTLLEESQNQQEKNEQELLELDKWASLWNWF-NH2 

997 Ac - YMKQLADSLMQLARQVSRLESA-NH2 

998 Ac - YLMQLARQMKQLADSLMQLARQVSRLESA-NH2 

999 Ac - YQEWERKVDFLEENITALLEEAQ IQQEKNMYELQKL -NH2 
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1000 AC-WMAWAAAINNYTSLIHSLIEESQNQQEKNEQEEEEE-NH2 

1001 Ac - YAS L IAAL IEES QNQQEKNEQELLELAKWAALWAWF - NH2 

1002 [AC-EWDREINNYTSLIHSLIEESQNQQEKNEQEGGC-NH2] dimer 

1003 AC-YDISIELNKAKSDLEESKEWIKKSNQKLDSIGNWH-NH2 

1004 B iot inyl - IDI S IELNKAKSDLEESKE WIKKSNQKLDS IGNWH -NH2 

1005 Ac-YTSLI-OH 

1 0 0 6 Fmoc -HSLIEE -OH 

1007 Fmoc - SQNQQEK - OH 

1008 Fmoc -NEQELLEL-OH 

1009 Fmoc-DKWASL-OH 

1010 Fmoc-WNWF-OH 

1011 Ac - AKTLERTWDTLNHLLF IS SAL YKLNLKS VAQ ITLS I -NB2 

1012 Ac - NITLQAKIKQF INMWQEVGKAMYA - NH2 

1013 Ac - LENERTLDFHDSNVKNLYDKVRLQLRDN-NH2 

1014 Ac -LENERTLDFHDSNVKNLYDKVRLQLRDNVKELGNG-NH2 

1015 AC-TLDFHDSNVKNLYDKVRLQLRDNVKELGNGAFEF-NH2 

1016 Ac - ID IS IELNKAKSDLEES KEWIKKSNQKLDS IGNWH -NH2 

1021 Bio t inyl - S ISQVNEEINQALAY IRKADELL -NH2 

1022 Biot inyl - S ISQVNEEINQSLAYIRKSDELL-NH2 

1023 Ac - S I SQVNEE INQS LAYIRKSDELL - NH2 

1024 Ac - ID IS IELNKAKSDLEES KEWIEKSNQELDS IGNWE -NH2 

1025 Ac - IDI S IELNKAKSDLEES KEWIKKSNQELDS IGNWH -NH2 
102 6 Ac - IDIS IELNKAKSDLEEAKEWIDDANQKLDS IGNWH -NH2 
1027 Ac - IDIS IELNKAKSDLEES KEWIKKANQKLDS IGNWH -NH2 

102 8 AC - IDIS IELNKAKSDLEEAKEWIKKSNQKLDS IGNWH -NH2 

1029 Biot inyl -NSVALDPIDISIELNKAKSDLEESKEWIKKSNQKL-NH2 

103 0 Biot inyl -ALDP ID IS IELNKAKSDLEES KEWIKKSNQKLDS I -NH2 

1031 desAminoTyrosine-NSVALDPIDISIELNKAKSDLEESKEWIKKSNQKL 

1032 desAminoTyrosine-ALDPIDISIELNKAKSDLEESKEWIKKSNQKLDSI 

1033 Ac - YDAS I SQVNEE INQALAF IRKADEL -NH2 

1034 Ac - YDASI SQVNEE INQSLAYIRKADELL -NH2 

1035 Biot inyl -YDAS I SQVNEEINQALAYIRKADELL-NH2 

1036 Biot inyl -YDASISQVNEEINQSLAFIRKSDELL-NH2 

1037 AC - YDAS I SQVNEE INQSLAFIRKSDELL -NH2 

103 8 Ac - WLEWDREINNYTSLIHSLIEESQNQQEKNEQEL-NH2 

103 9 Biotinyl-IDISIELNKAKSDLEESKEWIRRSNQKLDSIGNWH-NH2 

1044 AC - YESTQKAFDGITNKVNS VIEKTNTQFEAVGKEFGNLEKR -NH2 

1045 B i o t i n -DEYDAS I S Q VNEKINQSLAFIRKSDELL - NH2 

1046 Ac -MEWDREINNYTSLIHSLIEESQNQQEKNEQELL-NH2 

104 7 Ac - WQE WEQKVRYLEANISQSLEQAQIQQEKNMYEL -NH2 

1048 Ac - WQEWEQKVRYLEANISQSLEQAQIQQEKNEYEL-NH2 

1049 AC-WQEWEQKVRYLEANITALLEQAQIQQEKNEYEL-NH2 

1050 Ac - WQEWEQKVRYLEANITALLEQAQIQQEKNMYEL-NH2 

1051 Ac - WQEWEQKVRYLEANISQSLEQAQIQQEKNEYELQKL -NH2 

1052 Ac - WQEWEQKVRYLEANITALLEQAQIQQEKNEYELQKL -NH2 

1053 AC - WQEWEQKVRYLEANITALLEQAQIQQEKNMYELQKL -NH2 

1054 AC-IDISIELNKAKSDLEESKEWIEKSNQKLDSIGNWH-NH2 

1055 AC - EFGNLEKRLENLNKRVEDGFLDVWTYNAELLVALENE -NH2 

1056 Ac - EDGFLDVWTYNAELLVLMENERTLDFHDSNVKNLYDKVRMQL -NH2 

1057 Ac - S ISQVNEKINQSLAFIRKSDELL -NH2 
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1058 de s aminoTyr - S I SQVNEKINQSLAFIRKSDELL -NH2 

1059 Ac - S I SQVNEKINQSLAYIRKSDELL -NH2 

1060 Ac - QQLLDVVKRQQEMLRLTVWGTKNLQARVTAIEKYLKDQ -NH2 

1061 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFC 

1062 Ac - FDAS ISQVNEKINQSLAYIRKSDELL-NH2 

1063 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWA 

1064 Indole - 3 - acetyl - DEFDAS ISQVNEKINQSLAFIRKSDELL-NH2 

1065 Indole - 3 - acetyl -DEFDESISQVNEKINQSLAFIRKSDELL-NH2 

1066 Indole -3 -acetyl -DEFDESISQVNEKIEQSLAFIRKSDELL-NH2 

1067 Indole - 3 - acetyl -DEFDESISQVNEKIEESLAFIRKSDELL -NH2 

1068 Indole -3 -acetyl -DEFDESISQVNEKIEESLQFIRKSDELL-NH2 

1069 Indole -3- acetyl -GGGGGDEFDASISQVNEKINQSLAFIRKSDELL-NH2 

1070 2-Napthoyl-DEFDASISQVNEKINQSLAFIRKSDELL-NH2 

1071 desNH2Tyr-DEFDASISQVNEKINQSLAFIRKSDELL-NH2 

1072 biotin-ALDPIDISIELNKAKSDLEESKEWIRRSNQKL»DSI-NH2 

1073 AC-YDASISQVNEKINQALAYIRKADELLHNVNAGKST-NH2 

1074 Ac - VYPSDEYDAS I SQVNEKINQAIiAYIRKADELLHNV - NH2 

1075 Ac -VYPSDEYDAS IS QVNEKINQSLAYIRKSDELLHNV-NH2 

1076 Ac - WGWGYGYG -NH2 

1077 Ac - YGWGWGWGF -NH2 

1078 Ac - WQEWEQKVRYLEANITALQEQAQIQAEKAE YELQKL -NH2 

1079 Ac - WQEWEQKVRYLEAE ITALQEEAQIQAEKAE YELQKL - NH2 

1081 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWAS 

1082 AC-VWPSDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

1083 AC - SKNISEQIDQIKKDEQKEGTGWGLGGKWWTSDWGV-NH2 

1084 AC-LSKNISEQIDQIKKDEQKEGTGWGLGGKWWTSDWG-NH2 

1085 Ac - DLSKNISEQIDQI KKDEQKEGTGWGLGGKWWTS D W - NH2 

1086 Ac - EDLS KNI SEQ IDQ IKKDEQKEGTGWGLGGKWWTSD - NH2 

1087 Ac - IEDLSKNISEQIDQIKKDEQKEGTGWGLGGKWWTS -NH2 

1088 Ac - G IEDLS KN I S EQ IDQ I KKDEQKEGTGWGLGGKWWT - NH2 
10 89 Ac - IGIEDLSKNISEQIDQIKKDEQKEGTGWGLGGKWW-NH2 

1090 2 -Napthoyl - -PSDEFDASISQVNEKINQSLAFIRKSDELLHNVN-NH2 

1091 Ac - VYPSDEYDAS ISQVNEKINQALAYIRKADELLENV-NH2 

1092 AC-VYPSDEFDASISQVNEKINQALAFIRKADELLENV-NH2 
10 93 Ac -VYPSDEYDAS I SQVNEKINQALAY I READELLENV-NH2 

1094 Bi o t iny 1 - YDAS I SQVNEKINQSLAFIRESDELL -NH2 

1095 AC-AIGIEDLSKNISEQIDQIKKDEQKEGTGWGLGGKW-NH2 

1096 AC-AAIGIEDLSKNISEQIDQIKKDEQKEGTGWGLGGK-NH2 

1097 AC-DAAIGIEDLSKNISEQIDQIKKDEQKEGTGWGLGG-NH2 

1098 Ac -PDAAIGIEDLSKNISEQIDQIKKDEQKEGTGWGLG-NH2 

1099 Ac - NITDKIDQI IHDFVDKTLPDQGDNDNWWTGWRQWI -NH2 

1100 Ac - KNITDKIDQI IHDFVDKTLPDQGDNDNWWTGWRQW-NH2 

1101 Ac - T KNITDKIDQI IHDFVDKTLPDQGDNDNWWTGWRQ -NH2 

1102 Ac - WTKNITDKIDQ I IHD FVDKTLPDQGDNDNWWTGWR - NH2 

1103 Ac -D WTKNITDKIDQ I IHD FVDKTLPDQGDNDNWWTGW-NH2 

1104 AC - HD WT KNITDKIDQI IHD FVDKTLPDQGDNDNWWTG - NH2 

1105 AC - PHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNWWT - NH2 

1106 Ac - EPHDWT KNITDKIDQI IHDFVDKTLPDQGDNDNWW -NH2 

1107 AC-IEPHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNW-NH2 

1108 AC-AIEPHDWTKNITDKIDQIIHDFVDKTLPDQGDNDN-NH2 
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1109 AC - AAIEPHDWTKNITDKIDQI IHDFVDKTLPDQGDND -NH2 

1110 AC -DAAIEPHDWTKNITDKIDQI IHDFVDKTLPDQGDN-NH2 

1111 Ac -LS PTVWLS VI WMMWYWGPSLYS ILSPFLPLLPIFF-NH2 

1112 AC-GLSPTVWLSVIWMMWYWGPSLYSILSPFLPLLPIF-NH2 

1113 Ac - VGLS PTVWLS VI WMMWYWGPSLYS ILS PFLPLLP I - NH2 

1114 AC - F VGLS PTWLS VI WMMWYWGPSLYS ILS PFLPLLP -NH2 

1115 Ac - WFVGLSPTVWLS VI WMMWYWGPSLYS ILS PFLPLL -NH2 

1116 Ac -QWFVFLS PTVWLS VI WMMWYWGPSLYS ILS FFLPL-NH2 

1117 Ac - VQWFVGLS PTVWLS VIWMMWYWGPSLYS ILSPFLP -NH2 

1118 Ac - FVQWFVGLSPTVWLSVIWMMWYWGPSLYS ILSPFL-NH2 

1119 Ac -PFVQWFVGLSPTVWLSVI WMMWYWGPSLYS ILS PF-NH2 

1120 Ac -VP FVQWFVGLS PTVWLS VI WMMWYWGPSLYS ILS P-NH2 

112 1 Ac - LVP FVQWFVGLS PTVWLS VIWMMWYWGPSLYS ILS -NH2 

1122 H -NHTTWMEWDRE INNYTSL IHSLIEESQNQQEKNEQELLELDKW - OH 

1123 H - QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ - OH 

1124 AC-VYPSDEFDASISQVNEKINQSLAFIREADELLENV-NH2- 

1125 AC-VFPSDEFDASISQVNEKINQSLAYIREADELLENV-NH2 
112 6 AC - DE FDAS I S QVNEKINQS LAYIREADELL - NH2 

1127 AC-NEQELLELDKWASLWNWFGGGGDEFDASISQVNEKINQSLAFIRKSDELL-NH2 

112 8 AC-LELDKWASLWNWFGGGGDEFDASISQVNEKINQSLAFIRKSDELL-NH2 

112 9 Naphthoyl-EGEGEGEGDEFDASISQVNEKINQSLAFIRKSDELL-NH2 

113 0 AC - AS RKCRAKFKQLLQHYRE VAAAKS SENDRLRLLLKQMCPS LDV - NH2 

1131 Naphthoyl - GDEEDAS ISQVNEKINQSLAF IRKSDELL -NH2 

1132 Naphthoyl -GDEEDASESQVNEKINQSLAFIRKSDELL-NH2 

1133 Naphthoyl -GDEEDASESQQNEKINQSLAFIRKSDELL-NH2 

1134 Naphthoyl -GDEEDASESQQNEKQNQSLAFIRKSDELL-NE2 

1135 Naphthoyl -GDEEDASESQQNEKQNQSEAF IRKSDELL -NH2 

1136 Ac -WGDEFDES ISQVNEKIEESLAFIRKSDELL -NH2 

1137 AC-YTSLGGDEFDESISQVNEKIEESLAFIRKSDELLGGWNWF-NH2 

1138 Ac-YTSLIHSLGGDEFDESISQVNEKIEESLAFIRKSDELLGGWASLWNWF-NH 

113 9 2 -Naphthoyl -GDEFDESISQVNEKIEESLAFIRKSDELL-NH2 

1140 -2 - Naphthoyl - GDEEDES I SQVNEKIEESLAF IRKSDELL -NH2 

1141 2 -Naphthoyl - GDEEDES ISQVQEKIEESLAFIRKSDELL-NH2 

1142 2 -Napht hoyl -GDEEDES I SQVQEKIEESLLF IRKSDELL -NH2 

1143 B io t in-GDEYDES ISQVNEKIEESLAFIRKSDELL -NH2 

1144 2 -Naphthoyl -GDEYDESISQVNEKIEESLAFIRKSDELL-NH2 

1145 Ac - YTSLIHSLIDEQEKIEELAFIRKSDELLELDKWNWF -NH2 

1146 VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

1147 Ac - NNLLRAI EAQQHLLQLTVWGS KQLQARILAVERYLKDQ -NH2 

114 8 GGGVYPSDEYDAS ISQVNEEINQALAYIRKADELLENV-NH2 

1149 Ac -NNLLRAIEAQQHLLQLTVWGEKQLQARILAVERYLKDQ -NH2 

1150 Ac - PTRVNYILI IGVLVLAbuEVTGVRADVHLL - NH2 

1151 Ac - PTRVNYILI IGVLVLAbuEVTGVRADVHLLEQPGNLW -NH2 

1152 Ac-PEKTPLLPTRVNYILIIGVLVLAbuEVTGVRADVHLL-NH2 

1153 AhaGGGVYPSDEYDAS ISQVNEE INQALAYIRKADELLENV -NH2 

1155 Ac -YTSLIHSLGGDEFDES ISQVNEKIEESLAFIRKSDELL -NH2 

1156 Ac - YTSLGGDEFDES ISQVNEKIEESLAFIRKSDELL -NH2 

1157 AC-DEFDESISQVNEKIEESLAFIRKSDELLGGWASLWNWF-NH2 

1158 Ac - DEFDES ISQVNEKIEESLAFIRKSDELLGGWNWF -NH2 

1159 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKASLWNWF-NH2 
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Ac - YTSLIHSLIEESQNQQEKNEQELLELDKSLWNWF -NH2 — 

AC-YTSLIHSLIEESQNQQEKNEQELLELDKLWNWF-NH2 

Ac - YTSLIHSL I EES QNQQEKNEQELLELDKWNWF - NH2 

AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKASLWNWF-NH2 
AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKSLVJNWF-NH2 
AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKLWNWF-NH2 
AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWNWF-NH2 
Ac - MTWMEWDRE INNYTS LIHSLI EESQNQQEKNEQELLELDKWAS LWN - NH2 
Ac - MTWME WDRE INNYTS L IHS LIEESQNQQEKNEQELLELD KWAS L -NH2 
( Pyr ) HWSY ( 2 -nap thyl -D-Ala) LRPG-NH2 
AC -WNWFDEFDESISQVNEKIEESIAFIRKSDELLWNWF-NH2 
Ac - YTSLIHSL IEESQNQQEKNEQELLELDKYASLYNYF -NH2 
AC-YTSLIHSLIEESQNQQEKNEQELLELDKYAYLYNYF-NH2 

2 -Naphthoyl -AcaAcaAcaI)EFDESISQVNEKIEESLAFIRKSDELLAcaAcaAcaW-NH2 

2 -Naphthoyl - AcaAcaAcaGDEFDESISQVNEKIEESLAFIRKSDELLGAcaAcaAcaW-NH2 

2 -Naphthoyl -GDEFDESISQVNEKIEESLAFIRESDELL-NH2 

2-Naphthoyl-GDEFDESISQVNEKIEESLAFIEESDELL-NH2 

Ac - WQEWEQKVNYLEANITALLEQAQIQQEKNE YELQKL -NH2 

Ac - WQEWEQKVDYLEANITALLEQAQIQQEKNEYELQKL-NH2 

AC-WQEWEQKVRWLEANITALLEQAQIQQEKNEYELQKL-NH2 

AC -WQEWEKQVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 

AC-WQEWEHQVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 

Ac - WQE WEHKVRYLEANITALLEQAQ IQQEKNE YELQKL - NH2 

Ac - WQEWDRE VRYLEANITALLEQAQ IQQE KNE YELQKL - NH2 

AC-WQEWEREVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 

Ac -WQEWERQVRYLEANITALLEQAQIQQEKNEYELQKL -NH2 

Ac -WQEWEQKVKYLEANITALLEQAQIQQEKNE YELQKL -NH2 

Ac -WQEWEQKVRFLEANITALLEQAQIQQEKNE YELQKL -NH2 

Ac-VNalPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

Ac - VNa 1 P SDENa IDAS I S QVNEE INQ ALA YIRKADELLENV- NH2 

Ac-VNalPSDEYDASISQVNEEINQALANalIRKADELLENV-NH2 

Ac - VYPS DEFDAS I S QVNEKINQ S LAF IREADELLFNFF - NH2 

AC-VYPSDEYDASISQVNEEINQALAYIRKADELLFNFF-NH2 

Ac - YTSLITALLEQAQIQQEKNEYELQKLDKWASLWNWF -NH2 

AC-YTSLITALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

AC-YTSLITALLEQAQIQQEKNEYELQKLDEWASLWEWF-NH2 

Ac - YTS L ITALLE Q AQ I QQEKNE YELQELDE WAS LWE WF - NH2 

AC-YTSLITALLEEAQIQQEKNEYELQELDEWASLWEWF-NH2 

Naphthoyl - Aua - Aua - Aua - TALLEQAQIQQEKNEYELQKLAua - Aua - Aua - W - NH2 

Ac -WAAWEQKVRYLEANITALLEQAQ I QQEKNE YELQKL -NH2 

AC -WQEAAQKVRYLEANITALLEQAQ I QQEKNE YELQKL -NH2 

Ac -WQEWAAKVRYLEANITALLEQAQ I QQEKNE YELQKL -NH2 

AC -WQAAEQKVRYLEANITALLEQAQIQQEKNE YELQKL -NH2 

Ac - WQEWEAAVRYLEANITALLEQAQ I QQEKNE YELQKL -NH 2 

Ac -WQEWEQAARYLEANITALLEQAQIQQEKNE YELQKL -NH2 

AC - WQEWEQKAAYLEANITALLE QAQ I QQEKNE YELQKL -NH2 

AC -WQE WE QKVAALEAN ITALLE QAQ I QQEKNE YELQKL -NH2 

AC - WQEWEQKVRYLEANITALLEQAQIQQEKNEYELQKLGGGGWASLWNF -NH2 

2 -Naphthoyl -GDEFDAS ISQVNEKINQSLAFIRKSDELT-NH2 

2 -Naphthoyl - GDEFDAS ISQVNEKINQSLAFTRKSDELT - NH2 
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2-Naphthoyl-GDEFDASISQTNEKTNQSLAFTRKSDELT-NH2 
2 "Naphthoyl -GDEFDASTSQTNEKTNQSLAFTRKSDELT-NH2 
2 "Naphthoyl -GDEYDASTSQTNEKTNQSLAFTRKSDELT-NH2 
2 -Naphthoyl - GDEFDEE I S QVNEKI EES LAFIRKSDELL - NH2 
2 - Naphthoyl - GDEFDAS I SQVNEKINQS LAF IRKSDELA - NH2 
2 -Naphthoyl - GDEFDAS ASQANEKANQSLAFARKSDELA-NH2 
2 -Naphthoyl -GDEFDESISQVNEKIEESLAFTRKSDELL-NH2 
2 -Naphthoyl -GDEFDESISQVNEKTEESLAFIRKSDELI#-NH2 
2 " Naphthoyl - GDEFDES ISQTNEKIEESLAFIRKSDELL -NH2 
2 -Naphthoyl -GDEFDESTSQVNEKIEESLAFIRKSDELL-NH2 
AC-WNWFDEFDESTSQVNEKIEESLAFIRKSDELLWNWF-NH2 
AC-WNWFDEFDESTSQTNEKIEESLAFIRKSDELLWNWF-NH2 
AC-WNWFDEFDESTSQTNEKTEESLAFIRKSDELLWNWF-NH2 
Ac - LQAGFFLLTRILTI PQSLDS WWTSLNFLGGTTVAL -NH2 
AC-YTNLIYTLLEESQNQQEKNEQELLELDKWASLWSWF-NH2 
AC-WQEWEQKVRYLEANITALLEQAQIQQEKNEYELQKLDKWASLWNWF-NH2 

Ac-NimTWQEWEQKVRYLEANITALLEQAQIQQEKNEYELQKLDKWASLWNWF-NH2 
Ac - WNWFIEESDELLWNWF-NH2 

2 -Naphthoyl -GFIEESDELLW-NH2 
Ac - WF IEESDELLW-NH2 

2 - Naphthoyl -GFNFFIEESDELLFNFF-NH2 
2 - Naphthoyl - GESDELW -NH2 

AC-WNWFGDEFDESISQVQEEIEESLAFIEESDELLGGWNWF-NH2 
AC-WNWFIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 
Ac - YTSLITALLEQAQ IQQEENEYELQALDE WASLWEWF -NH2 

AC-YTSLIHSLGGDEFDESISQVNEEIEESLAFIEESDELLGGWASLWNWF-NH2 
2 -Naphthoyl- GDEFDES ISQVQEEIEESLAFIEESDELL-NH2 

H-QARQLLSSIMQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-OH 
Ac - CPKYVKQNTLKLATGMRNVPEKQTR-NH2 

AC-GLFGAIAGFIENGWEGMIDGWYGFRHQNSC-NH2 
Ac - LNFLGGT - NH2 

Ac - LDS WWTS LNFLGGT -NH2 
Ac - ILTIPQSLDSWWTSLNFLGGT-NH2 
AC-GFFLLTRILTIPQSLDSWWTSLNFLGGT-NH2 
Ac - WQEWEQKITALLEQAQIQQEKNE YELQKLDKWASLWNWF -NH2 . 
Ac - WNWFITALLEQAQIQQEKNEYELQKLDKWASLWNWF-NH2 
AC-WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 
AC-WQEWEQKVRYLEANITALLEQAQIQQEKIEYELQKL-NH2 
AC-WQEWEQKVRYLEAQITALLEQAQIQQEKIEYELQKL-NH2 
Ac - KENKANGTDAKVKLIKQELDKYKNAVTELQLLMQS -NH2 
AC -NIKENKANGTDAKVKLIKQELDKYKNAVTELQLLM -NH2 
( FS ) - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 
2 -Naphthoyl -GWNWFAcaDEFDESISQVQEElEESLAFIEESDELLAcaWNWF-NH2 
AC-WNWFGDEFDESISQVNEKIEESLAFIEESDELLGWNWF-NH2 
AC-WNWFGDEFDESISQVNEKIEESLAFIRKSDELLGWNWF-NH2 
Ac-WNWF-Aca-DEFDESISQVNEKIEESLAFIRKSDELL-Aca-WNWF-NH2 
Ac - WNWF -Aca -DEFDES ISQVNEKIEESLAFIEESDELL- Ac a - WNWF -NH2 
Ac -EESQNQQEKNEQELLELDKWA-NH2 
EESQNQQEKNEQELLELDKWA 
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12 62 AC-CGTTDRSGAPTYSWGA1TOTDVFVLNNTRPPLGNWFG-NH2 

12 63 AC-GVEHRLEAACNWTRGERADLEDRDRSELSP-NH2 

12 64 Ac -CVREGNASRAWVAVTPTVATRDGKLPT-NH2 

1265 AC - CFSPRHHWTTQDANAS I YPG-NH2 

1266 AC -LQHYREVAAAKSSENDRLRLLLKQMCPSLDVDS -NH2 

1267 AC-WQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWASLWEWF-NH2 
12 6 8 Ac -CWQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWASLWEWFC-NH2 
12 69 AC-WQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWEWF-NH2 

1270 AC-CWQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWEWFC-NH2 

1271 Ac -GQNSQSPTSNHSPTSAPPTAPGYRWA-NH2 
12 72 Ac -PGSSTTSTGPARTALTTAQGTSLYPSA-NH2 

1273 Ac - PGS STTSTGPARTALTTAQGTSLYPSAAATKPSDGNATA-NH2 

1275 AC-WQEWDREITALLEQAQIQQEKNEYELQKLDKWASLWNWF-NH2 

1276 AC-WQEWDREITALLEQAQIQQEKNEYELQKLDEWASLWEWF-NH2 

1277 Ac - WQEWDRE ITALLEQAQIQQEKNE YELQKLDEWEWF -NH2 
12 7 8 Ac -WQEWDREITALLEQAQIQQEKNEYELQKLDEWEWF-NH2 
12 79 Ac - WQEWEREITALLEQAQIQQEKNEYELQKLIEWEWF -NH2 
12 80 AC-WQEWEREITALLEQAQIQQEKIEYELQKLDEWEWF-NH2 
12 81 AC - WQE WE ITALLEQAQ I QQEKNE YELQKLDEWEWF - NH2 
12 82 AC-WQEWEITALLEQAQIQQEKNEYELQKLIEWEWF-NH2 
12 83 AC-WQEWEITALLEQAQIQQEKIEYELQKLDEWEWF-NH2 
12 84 AC-WQEWEITALLEQAQIQQEKIEYELQKLIEWEWF-NH2 

12 85 AC - WQEWDRE IDE YDAS I S QVNEKINQALAYIREADELWEWF - NH2 

1286 Ac - WQEWEREIDEYDAS ISQVNEKINQALAYIREADELWEWF -NH2 

12 87 Ac - WQEWEIDEYDAS ISQVNEKINQALAYIREADELWEWF-NH2 

12 88 Ac - WQEWDRE IDE YDAS I S QVNEEINQALAYIREADELWEWF -NH2 

1289 Ac -WQEWEREIDEYDAS ISQVNEEINQALAYIREADELWEWF -NH2 

12 90 Ac -WQEWEIDEYDASISQVNEEINQALAYIREADELWEWF-NH2 

12 91 AC - WQEWDE YDAS I S QVNEKINQALAYIREADELWEWF -NH2 

12 92 AC - WQEWDE YDAS I SQVNEEINQALAYIREADELWEWF-NH2 

12 93 AC-WQEWEQKITALLEQAQIQQEKIEYELQKLIEWEWF-NH2 

12 94 Ac - WQEWEQKITALLEQAQIQQEKIEYELQKLIEWASLWEWF -NH2 

12 95 AC-WQEWEITALLEQAQIQQEKIEYELQKLIEWASLWEWF-NH2 

12 98 -VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

12 99 AC - WVYPSDEYDAS I SQVNEE INQALAYIRKADELLENVWNWF -NH2 

13 0 0 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 
13 01 Ac - WQEWDEYDAS I SQVNEKINQALAYIREADELWAWF -NH2 
13 02 AC - WQAWDEYDAS ISQVNEKINQALAYIREADELWAWF -NH2 
1303 Ac - WQAWDEYDAS I S QVNEKINQALAYIREADELWEWF -NH2 

13 04 Biotin-YDPLVFPSDEFDASISQVNEKINQSLAFIRKSDEL-NH2 

1305 Biotin- YDPLVFPSDEFDAS I SQVNEKINQSLAF -NH2 

13 06 B iot ill - QVNEKINQSLAFIRKSDELLHNVNAGKST -NH2 

1307 AC-WMEWDREI-NH2 

13 08 AC-WQEWEQKI-NH2 

1309 Ac - WQEWEQKITALLEQAQIQQEKIEYELQKLIKWASLWEWF -NH2 

1310 Ac - WQE WEQKI TALLEQAQ IQQE KIE YELQKL IE WAS LWEWF - NH2 

1311 Ac - WQEWEREISAYTS L ITALLEQAQIQQEKIE YELQKLIEWEWF - NH2 

1312 AC-WQEWEREISAYTSLITALLEQAQIQQEKIEYELQKEWEWF-NH2 

1313 AC -WQEWEREISAYTSLITALLEQAQIQQEKIEYELQKEWEW-NH2 

13 14 Ac - WQEWEREISAYTSLITALLEQAQIQQEKIEYELQKLIEWEW-NH2 



- 75 - 



WO 01/51673 



PCT/US00/35727 



T 

NO. 
1315 



Sequence 



10 



15 



20 



25 



30 



1316 

1317 

1318 

1319 

1320 

1321 

1322 

1323 

1324 

1325 

1326 

1327 

1328 

1329 

1330 

1331 

1332 

1333 

1334 

1335 

1336 

1337 

1338 

1339 

1340 

1341 

1342 

1344 

1345 

1346 

1347 

1348 

1349 

1350 

1351 

1352 

1353 

1354 

1355 

1356 

1357 

1358 

1361 

1362 

1363 

1364 

1365 

1366 

1367 



Ac -FNLSDHSES IQKKFQLMKKHVNKIGVDSDPIGSWLR -NH2 

AC-DHSESIQKKFQLMKKHVNKIGVDSDPIGSWLRGIF-NH2 

Ac - WS VKQANLTTSLLGDLLDDVTS IRHAVLQNRA -NH2 

Biotin-WMEWDREI-NH2 

Biotin-NNMTWMEWDREINNYTSL-NH2 

AC -GAASLTLTVQARQLLSGIVQQQNNLLRAIEAQQHLL -NH2 

AC - ASLTLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQL -NH2 

AC - VSVGNTLYYVNKQEGKSLYVKGEP I INFYDPLVF-NH2 

Ac - QHWS YGLRPG -NH2 

Ac -WQEWEQKIQHWSYGLRPGWASLWEWF-NH2 

Ac - WQEWEQKIQHWS YGLRPGWE WF -NH2 

Ac - WNWFQHWS YGLRPGWNWF -NH2 

Ac - FNFFQHWS YGLRPGFNFF -NH2 

Ac - GAGAQHWS YGLRPGAGAG - NH2 

PLLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGT 

Ac-WQEWEQKITALLEQAQIQQEKIEYELQKLAKWASLWEWF-NH2 

Ac - WQEWEQKITALLEQAQIQQEKIE YELQKLAEWASLWEWF - NH2 

AC-WQEWEQKITALLEQAQIQQEKAEYELQKLAEWASLWEWF-NH2 

Ac - WQE WE QKI TALLEQAQ I QQE KAE YELQKLAE WAS LWAWF - NH2 

Ac - WQEWEQKITALLEQAQIQQEKAE YELQKLAKWASLWAWF - NH2 

Ac - TNKA WS L SNGVS VLTS KVLDLKNYIDKQLLP I VNK - NH2 

Ac - KAWSLSNGVSVLTS KVLDLKNYIDKQLLP I VNKQS -NH2 

Ac - WQE WEQKI TALLEQAQ IQQEKNEYELQKL I E WE WF - NH2 

AC-WQEWEQKITALLEQAQIQQEKNEYELQKLIEWEWF-NH2 

AC-WQEWEQKITALLEQAQIQQEKIEYELQKLDKWEWF-NH2 

Ac - YDPLVFPSDEFDASISQVNEKINQSLAF-NH2 

Fluor - -VYPSDEYDAS ISQVNEEINQALAYIRKADELLENV-NH2 

Fluor -YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

Ac - SG I VQQQNNLLRAI EAQQHLLQLTVWG I KQLQARI L - NH2 

AC-QQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

AC - SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

Ac - WQEWEQKITALLEQAQIQQEKNE YELQKLAEWAS LWAWF -NH2 

Ac - WQEWEQKITALLEQAQIQQEKNE YELQKLAEWAS LWAW -NH2 

Ac - WQEWEQKITALLEQAQIQQEKAE YELQKLAEWASLWAW - NH2 

AC-WQEWEQKITALLEQAQIQQEKNEYELQKLAEWAGLWAWF-NH2 

Ac - WQEWEQKITALLEQAQIQQEKNE YELQKLAE WAGLWAW - NH2 

Ac- WQEWEQKITALLEQAQIQQEKAE YELQKLAEWAGLWAW-NH2 

Ac - WQEWEQKITALLEQAQIQQEKNE YELQKLDKWAGLWEWF -NH2 

Ac - WQEWQHWS YGLRPGWEWF -NH2 

Ac -WQAWQHWSYGLRPGWAWF-NH2 

Bio tinyl- WQEWEQKITALLEQAQIQQEKNE YELQKLDKWASLWEWF-NH2 

WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF 

WQEWEQKITALLEQAQIQQEKIE YELQKLIEWEWF 

Ac - AGS TMGARSMTLTVQARQLLSG I VQQQNNLLRAIEAQQ - NH2 

Ac - AGS AMGAASLTLSAQSRTLLAG IVQQQQQLLD WKRQQ -NH2 

Ac ~ AGSAMGAASTALTAQSRTLLAGI VQQQQQLLDWKRQQ -NH2 

Ac - ALTAQSRTLLAG I VQQQQQLLDWKRQQELLRLTVWGT -NH2 

Ac - TLS AQS RTLLAG I VQQQQQLLDWKRQQEMLRLTVWGT - NH2 

Ac - TLTVQARQLLSGI VQQQNNLLRAIEAQQHLLQLTVWG I - NH2 

Ac - WQAWIEYEAELSQVKEKIEQSLAYIREADELWAWF-NH2 
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1366 AC - WQAWIE YEAS LSQAKEKIEE S KAY IREADELWAWF -NH2 

13 69 AC - WQAWIEYERLLVQAKLKIAIAKLYIAKELLEWAWF -NH2 

1370 AC - WQAWIE YERLLVQVKLKIAIALLYIAKELLEWAWF-NH2 

1371 Ac - WQAWIELERLLVQVKLKLAIAKLE IAKELLEWAWF -NH2 

1372 AC -GEWTYDDATKTFTVTEGGH-NH2 

1373 AC - WQEWEQKIGEWTYDDATKTFTVTEGGHWASLWEWF -NH2 

1374 AC - GEWTYDDATKTFTVTE - NH2 

1375 Ac -WQEWEQKIGEWTYDDATKTFTVTEWASLWEWF-NH2 

1376 Ac -MHRFDYRT-NH2 

1377 Ac - WQEWEQKIMHRFD YRTWASL WE WF - NH2 
137 8 AC -MHRFNWSTGGG -NH2 

1379 AC - WQEWEQKIMHRFNWSTGGGWAS LWEWF - NH2 

13 8 0 Ac - MHRFNWS T -NH2 

1381 Ac - WQEWEQKIMHRFNWSTWASLWEWF -NH2 

1382 Ac -LLiVPIiARIMTMSSVHGGG-NH2 

13 83 Ac - WQEWEQKILLVPLARIMTMS S VHGGGWASLWEWF -NH2 

13 84 AC - LL VPLARIMTMS S VH - NH2 

13 85 AC-WQEWEQKILLVPLARIMTMSSVHWASLWEWF-NH2 

13 86 TALLEQAQIQQEKNEYELQKLDK 

13 87 AC -TALLEQAQIQQEKNEYELQKLDK-NH2 

13 88 AC-TALLEQAQIQQEKIEYELQKLIE-NH2 

1389 TALLEQAQIQQEKIEYELQKLIE 

13 90 AC -QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERY-NH2 

13 91 Rhod - QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWG IKQLQARILAVERY -NH2 

1392 Ac - GAAS LTLS AQSRTLLAG I VQQQQQLLD WKRQQEML - NH2 

13 93 AC -GSAMGAASLTLSAQSRTLIAGIVQQQQQLLDVVKRQQEML-NH2 

13 94 Ac -PALSTGLIHLHQNIVDVQFLFGVGSSIASWAIKWEY-NH2 

13 95 Ac - PALSTGLIHLHQNIVDVQFLYGVGSS IASWAIK-NH2 

1396 Ac -LSTTQWQVLPUSFTTLPALSTGLIHLHQNIVDVQY-NH2 

1397 Ac - FRKFPEATFSRUGSGPRITPRUMVDFPFRLWHY-NH2 

13 98 Ac-DFPFRLWHFPUTINYTIFKVRLFVGGVEHRLEAAXJNWTR-NH2cf 

13 99 AC - YVGGVEHRLEAAUNWTRGERUDLEDRDRSELS PL -NH2 
1400 MVYPSDEYDASISQVNEEINQALAYIRKADELLENV 

14 02 Ac -GPLLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGG-NH2 
14 03 AC -LGPLLVLQAGFFLLTRILTIPQSLDSWWTSLNFLG-NH2 
14 04 AC-FLGPLLVLQAGFFLLTRILTIPQSLDSWWTSLNFL-NH2 
14 05 Ac - YTNTIYTLLEESQNQQEKNEQELLELDKWASLWNWF -NH2 
14 06 YTNTIYTLLEESQNQQEKNEQELLELDKWASLWNWF 

14 07 Ac - YTGI I YNLLEESQNQQEKNEQELLELDKWANLWNWF -NH2 

1408 YTGIIYNLLEESQNQQEKNEQELLELDKWANLWNWF 

1409 Ac - YTSLIYSLLEKSQIQQEKNEQELLELDKWASLWNWF -NH2 

1410 YTSLIYSLLEKSQIQQEKNEQELLELDKWASLWNWF 

1411 Ac -EKSQIQQEKNEQELLELDKWA-NH2 

1412 EKSQIQQEKNEQELLELDKWA 

1413 AC-EQAQIQQEKNEYELQKLDKWA-NH2 

1414 AC-YTSLIGSLIEESQIQQERNEQELLELDRWASLWEWF-NH2 

1415 Ac - YTXL IHSLIXESQNQQXKNEQELXEIiDKWASLWNWF -NH2 

14 16 AC-YTXLIHSLIWESQNQQXKNEQELXELD-NH2 

1417 Ac - YTSLIHSLIEESQNQQEKNEQELLELD-NH2 

1418 AC-WQEQEXKITALLXQAQIQQXKNEYELXKLDKWASLWEWF-NH2 
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14 1 9 AC - XKITALLXQAQ IQQXKNE YELXKLDKWASLWEWF - NH2 

14 2 0 Ac - WQEWWXKITALLXQAQ IQQXKNE YELXKLD - NH2 

1421 Ac -WEQKITALLEQAQIQQEKNEYELQKLD-NH2 

1422 Ac - WEXKITALLXQAQIQQXKNEYELXKLD -NH2 
14 2 3 Ac -XKITALLXQAQ IQQXKNE YELXKLD -NH2 

1425 Ac - QKITALLEQAQIQQEKNEYELQKLD-NH2 

1426 Ac -QKITALLEQAQIQQEKNEYELQKLDKWASLWEWF -NH2 

142 7 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLD -NH2 

1428 Ac -VYPSDEYDAS IS QVNEE INQALAYIRKADELLEN- OH 

1429 Ac-VYPSDEYDASISQVNEEINQALAYIRKADELLE-OH 
14 3 0 Ac - VYPSDE YDAS I SQVNEEINQALAYIRKADELL - OH 

1431 Ac - VYPSDE YDAS I S QVNEE INQALAYIRKADEL - OH 

1432 YPSDEYDAS IS QVNEE INQALAYIRKADELLENV-NH2 

1433 PSDEYDAS ISQVNEEINQALAYIRKADELLENV-NH2 
10 1434 SDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

1435 DEYDAS ISQVNEEINQALAYIRKADELLENV-NH2 

1436 AC - VYPSDEYDAS ISQVDEEINQALAYIRKADELLENV-NH2 

1437 Ac- VYPSDEYDAS ISQVNEEIDQALAYIRKADELLENV-NH2 

143 8 Ac - VYPSDEYDAS IS QVNEE INQALAYIRKADELLEDV-NH2 

143 9 Ac - VYPSDEYDAS ISQVDEEIDQALAYIRKADELLENV-NH2 

144 0 Ac - LLS TNKA WS LSNGVS VLTS KVLDLKNYIDKQLLP - NH2 
1441 AC -LSTNKAWSLSNGVSVLTSKVLDLKNYIDKQLLPI -NH2 

15 1442 AC - STNKAWSLSNGVSVGTSKVLDLKNYIDKQLLPIV-NH2 

1443 AC -TNKAWSLSNGVSVLTSKVLDLKNYIDKQLLPIVN-NH2 

1444 Ac -NKAWSLSNGVSVLTSKVLDLKNYIDKQLLPIVNK-NH2 

1445 Ac - KAWSLSNGVSVLTSKVLDLKNYIDKQLLP IVNKQ -NH2 

1446 Ac - AWSLSNGVS VLTSKVLDLKNYIDKQLLPIVNKQS -NH2 

1447 AC-WSLSNGVSVLTSKVDLKNYIDKQWLLPIVNKQSU-NH2 
144 8 Ac -VSLSNGVSVLTS KVLDLKNYIDKQLLP I VNKQSUS -NH2 
1449 AC - SLSNGVSVLTSKVLDLKNYIDKQLLPI VNKQSUS I -NH2 

20 1450 Ac- LSNGVS VLTSKVLDKLKNYIDKQLLPIVNKQSUS IS -NH2 

1451 Ac - SNGVSVLTSKVLDLKNYIDKQLLP IVNKQSUS ISN-NH2 

1452 Ac - NGVS VLTS KVLDLKNYIDKQLLP I VNKQSUS I SNI -NH2 

1453 ' Ac - GVS VLTS KVLDLKNYIDKQLLP IVNKQSUS ISNIE -NH2 

1454 Ac - VSVLTSKVLDLKNYIDKQLLPI VNKQSUS IS INIET-NH2 

1455 AC-SVLTSKVLDLKNYIDKQLLPIVNKQSUSISNIETV-NH2 

1456 Ac -VLTSKVIiDLKNYIDKQLLP IVNKQSUS ISNIETVI -NH2 

1457 AC - LTSKVLDLKNYIDKQLLP IVNKQSUS ISNIETVIE -NH2 
2 5 145 8 Ac - TS KVLDLKNYIDKQLLP I VKQSUS I SNIETVIEF -NH2 

1459 Ac - S KVLDLKNYIDKQLLP IVNKQSUS ISNIETVIEFQ -NH2 

1460 AC - KVLDLKNYIDKQLLPIVNKQSUSISNIETVIEFQQ -NH2 

1461 AC-VLDLKNYIDKQLLPITOKQSUSISNIETVIEFQQK-NH2 

1462 Ac - LDLKNYIDKQLLP I VNKQSUS I SNIETVIEFQQKN -NH2 

1463 AC-DLKNYIDKQLLPIVNKQSUSISNIETVIEFQQKNN-NH2 
14 64 AC-LKNYIDKQLLPIVNKQSUSISNIETVIEFQQKNNR-NH2 
1465 Ac - KNYIDKQLLP IVNKQSUS ISNIETVIEFQQKNNRL-NH2 
14 66 Ac - NYIDKQLLP IVNKQSUS I SNIETVIEFQQKNNRLL-NH2 

1467 Ac -YIDKQLLP IVNKQSUS I SNIETVIEFQQKNNRLLE-NH2 

1468 Ac - IDKQLLP IVNKQSUS ISNIETVIEFQQKNNRLLE I -NH2 

1469 AC-DKQLLPIVNKQSUSISNIETVIEFQQKNNRLLEIT-NH2 
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1470 Ac-KQLIiPIVNKQSUSISNIETVIEFQQKNNRLLEITR-NH2 

1471 AC-QLLPIVNKQSUSISNIETVIEFQQKNNRLLEITRE-NH2 

1472 Ac-VYPSDEYDASISQVNEEINQALA 

1473 QVNEEINQALAYIRKADELLENV-NH2 

1474 VYPSDEYDASISQVNEEINQALAYIRKADELLENV 

1475 Ac - DEYDAS I SQVNEE INQALAYIREADEL -NH2 

1476 Ac -DEYDAS ISQVNEKINQALAYIREADEL -NH2 

1477 Ac -DDECLNSVKNGTYDPPKFEEESKLNRNEIKGVKLS -NH2 

1478 Ac -DDE-Abu-LNSVKNGTYDFPKPEEESKLNRNEIKGVKLS -NH2 

1479 Ac - YHKCDDECLNSVKNGTFDFPKFEEESKLNRNEIKGVKLSS -NH2 

1480 Ac-YHK-Abu-DDE-Abu-LNSVKNGTFDFPKFEEESKLNRNEIKGVKLSS- 
14 81 Ac - YTSLIHSLIEESQIQQEKNEQELLELDKWASLWNWF-KH2 

1482 Ac-YTSLIHSLIEESQNQQEKNEYELLELDKWASIiWNWF-NH2 

14 83 Ac ~ YTSLIHSLIEESQIQQEKNEYELLELDKWASLWNWF -NH2 

14 84 Ac " YTSLIHSLIEESQIQQEKNEYELQKLDKWASLWNWF-NH2 

14 85 Ac - YTSLIHSLIEESQNQQEKNEQELQKLDKWASLWNWF -NK2 

14 86 Ac - YTSLIHSLIEESQNQQEKNE YELQKLDKWASLWNWF -NH2 

14 87 AC-YTSLIHSLIEESQIQQEKNEQELQKLDKWASLWNWF-NH2 

14 88 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWEWF-NH2 

14 89 Ac " YTSLIHSLIEESQIQQEKNEQELLELDKWASLWEWF -NH2 

14 90 Ac - YTSLIHSLIEESQNQQEKNEYELLELDKWASLWEWF-NH2 

14 91 Ac - YTSLIHSLIEESQIQQEKNEYELLELDKWASLWEWF-NH2 

14 92 AC -YTSLIHSLIEESQIQQEKNEYELQKLDKWASLWEWF-NH2 

1493 Ac - YTSLIHSLIEESQNQQEKNEQELQKLDKWASLWEWF -NH2 

14 94 AC ~ YTSLIHSLIEESQNQQEKNEYELQKLDKWASLWEWF-NH2 

1495 AC-YTSLIHSLIEESQIQQEKNEQELQKLDKWASLWEWF-NH2 

1496 Ac - WQEQEQKITALLEQAQIQQEKNEYELQKLDKEWWF-NH2 

1497 Ac -WQEWEQKITALLEQAQIQQEKNEYELQKLIEWASLWEWF-NH2 

14 98 AC -WQEWEQKITALLEQAQIQQEKNEYELQKLAKWASLWEWF-NH2 

1499 AC-WQEWEQKITALLEQAQIQQEKNEYELQKLIKWASLWEWF-NH2 

1500 Ac -WQEWEQKITALLEQAQIQQEKNEYELQKLIEWAGLWEWF-NH2 

1501 Ac - WQEWEQKITALLEQAQI QQEKNEYELQKLAKWAGLWEWF -NH2 

1502 AC-WQEWEQKITALLEQAQIQQEKNEYELQKLIKWAGLWEWF-NH2 

1503 AC -WQEWEQKITALLEQAQIQQEKNEYELQKLIEWAGLWAWF-NH2 

1504 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLAKWAGLWAWF -NH2 

1505 AC "WQEWEQKITALLEQAQIQQEKNEYELQKLIKWAGLWAWF-NH2 

1506 Ac - WQEWEQKITALLEQAQIQQEKGEYELQKLDKQEQF -NH2 

1507 AC-WQEWEQKITALLEQAQIQQEKGEYELLELDKWEWF-NH2 

15 08 Ac - WQEWEQKITALLEQAQIQQEKGEYELQKLAKWEWF-NH2 

1509 AC-WQEWEQKITALLEQAQIQQEKGEYELQKLDWQWEF-NH2 

1510 AC - WQEWEQKITALLEQAQIQQEKGEYELLELAKWEWF -NH2 

1511 Ac - WEQWEQKITALLEQAQIQQEKNEYELLELDKWEWF-NH2 

1512 Ac -WQEWEQKITALLEQAQIQQEKNEYELEEELIEWASLWEWF-NH2 

1513 Ac - WQEWEQKITALLEQAQIQQEKNEYELLELIEWAGLWEWF -NH2 

1514 AC-WQEWEQKITALLEQAQIQQEKNEYELLELIEWAGLWAWF-NH2 

1515 Ac - WQEWEREITALLEQAQIQQEKNEYELQKLIEWASLWEWF-NH2 

1516 Ac - WQEWERE I QQEKNEYELQKLDKWASLWEWF -NH2 

1517 Ac - WQEWERE I QQEKGEYELQKLIEWEWF-NH2 

1518 Ac - WQEWQAQIQQEKNEYELQKLDKWASLWEWF -NH2 

1519 Ac - WQE WQAQ IQQEKGEYELQKL IEWEWF -NH2 
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1520 PEG-GWQEWEQRITALLEQAQIQQERNEYELQRLDEWASLWEWF-NH2 

1521 Ac - GWQEWEQRITALLEQAQ IQQERNE YELQRLDE WAS LWEWF -NH2 

1522 PEG-YTSLITALLEQAQIQQERNEQELLELDEWASLWEWF-NH2 

1523 AC-YTSLITALLEQAQIQQERNEQELLELDEWASLWEWF-NH2 

1526 PEG-GWQEWEQRITALLEQAQIQQERNEYELQELDEWASLWEWF-NH2 

1527 Ac - GWQEWEQRITALLEQAQIQQERNEYELQELDEWASLWEWF -NH2 

1528 PEG-YTSLIGSLIEESQIQQERNEQELLELDRWASLWEWF-1JH2 

1529 PEG-GWQEWEQRITALLEQAQIQQERHEYELQRLDRWASLWEWF-KH2 

1530 AC-GWQEWEQRITALLEQAQIQQER1TEYELQRLDRWASLWEWF-NH2 

1531 PEG - GWQEWEQRITALLEQAQIQQERNEYELQELDRWASLWEWF -NH2 

1532 AC - GWQEWEQRITALLEQAQIQQER1IEYELQELDRWASLWEWF -NH2 

1533 PEG- YTSLIGSLIEESQNQQERNEQELLELDRWASLWNWF-NH2 

1534 AC-YTSLIGSLIEESQNQQERNEQELLELDRWASLWNWF-NH2 
1538 Ac-YTSLIHSLIEESQNQQEK-OH 

10 153 9 NEQELLELDK 

154 0 WASLWNWF-NH2 

1542 AC -AAAWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 
154 3 AC - WQEAAAKITALLEQAQ IQQEKNEYELQKLDKWAS LWEWF - NH2 

1544 AC-WQEWEQAAAALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1545 AC-WQEWEQKITAAAEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1546 Ac -WQEWEQKITALLAAAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1547 Ac -WQEWEQKITALLEQAAAAQEKNEYELQKLDKWASLWEWF-NH2 
15 154 8 AC-WQEWEQKITALLEQAQIQAAAHEYELQKLDKWASLWEWF-NH2 

1549 AC -WQEWEQKITALLEQAQIQQEKAAAELQKLDKWASLWEWF-NH2 

1550 AC -WQEWEQKITALLEQAQIQQEKNEYAAAKLDKWASLWEWF-NH2 

1551 AC-WQEWEQKITALLEQAQIQQEKNEYELQAAAKWASLWEWF-NH2 

1552 Ac-WQEWEQKITALLEQAQIQQEKNEYELQKLDAAASLWEWF-NH 

1553 AC - WQEWEQKITALLEQAQ IQQEKNE YELQKLDKWAAAAEWF -KH 

1554 Ac - WQEWEQKITALLEQAQ IQQEKNEYELQKLDKWAS LWAAA - NH 
1556 AC - YTSLIHSLIEESQNQQEKNEQELLLDKWASLWNWF-NH2 

20 1557 AC-YTSLIHSLIEESQNQEKNEQELLELDKWASLWNWF-NH2 

1558 Ac -ERTLDFHDS -NH2 

1559 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN(W) F-NH2 

1563 Ac-YTSLIHSLIEESQN (Q) QEKNEQELLELDKWASLWNWF-NH2 

1564 Ac - YTSLIHSLIEESQNQQDKWASLWNWF -NH2 

1566 Ac - FYEI IMD IEQNNVQGKKGIQQLQKWEDWVGWIGNI -NH2 

1567 Ac - INQTIWNHGNITLGEWYNQTKDLQQKFYEIIMDIE-NH2 

1568 Ac - WNHGNITLGEWYNQTKDLQQKF YE I IMDI EQNNVQ -NH2 
25 1572 Ac - YTSLIHSLIEESENQQEKNEQELLELDKWASLWNWF-NH2 

1573 AC-YTSLIHSLIEESQDQQEKNEQELLELDKWASLWNWF-NH2 

1574 Ac - YTSLIHSLIEESQNEQEKNEQELLELDKWASLWNWF-NH2 

1575 C - YTSLIHSLIEESQNQEEKNEQELLELDKWASLWNWF-NH2 

1576 AC-YTSLIHSLIEESQNQQEKDEQELLELDKWASLWNWF-1JH2 

1577 AC-LGEWYNQTKDLQQKFYEIIMDIEQNNVQGKKGIQQ-NH2 

1578 AC-WYNQTKDLQQKFYEIIMDIBQNKVQGKKGIQQLQK-NH2 

1579 AC-YTSLIHSLIEESQNQQEKNEEELLELDKWASLWNWF-NH2 

1580 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWDWF-NH2 
1586 AC-XTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWX-NH2 
1588 Ac - YNQTKDLQQKFYEIIMDIEQNNVQGKKGIQQLQKW-NH2 
1598 Ac- YTSL IHS L IEES QNQQEKNEQELLELDKWASLWNWF 
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1600 AC-TLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQAR-NH2 

1603 AC-LQQKFYEIIMDIEQNNVQGKKGIQQLQKWEDWVGW-NH2 

1627 AC-YTSLIHSLIEESQNQQEKNEQELLALDKWASLWNWF-NH2 

162 8 AC-YTSLIHSLIEESQNQQEKNEQELLEADKWASLWNWF-NH2 

162 9 AC-YTSLIHSLIEESQNQQEKNEQELLELAKWASLWNWF-NH2 

163 0 Ac - YTSLIHSLIEESQNQQEKAEQELLELDKWASLWNWF -NH2 

1631 AC-YTSLIHSLIEESQNQQEKNAQELLELDKWASLWNWF-NH2 

1632 AC-YTSLIHSLIEESQNQQEKNEAELLELDKWASLWNWF-NH2 

1634 AC-WQEWEQKITALLEQAQIQQEKNEQELQKLDKWASLWEWF-NH2 

1635 AC -WQEWEQKITALLEQAQIQQEKAEYELQKLDKWASLWEWF -NH2 

163 6 Ac - WQEWEQKITALLEQAQIQQEKNAYELQKLDKWASLWEWF -NH2 
1637 Ac - WQEWEQKITALLEQAQIQQEKNEAELQKLDKWASLWEWF -NH2 

1644 Ac -EYDLRRWEK-NH2 

1645 AC - EQELLELDK-NH2 

164 6 AC - EYELQKLDK-NH2 

164 7 AC-WQEWEQKITALLEQAQIQQEKNEQELLKLDKWASLWEWF-NH2 

1648 AC-WQEWEQKITALLEQAQIQQEKNEQELLELDKWASLWEWF-NH2 

164 9 AC-WQEWEQKITALLEQAQIQQEKNDKWASLWEWF-NH2 

165 0 Ac - YTSLIHSLIEESQNQAEKNEQELLELDKWASLWNWF-NH2 

1651 AC-YTSLIHSLIEESQNQQAKNEQELLELDKWASLWNWF-NH2 

1652 AC-YTSLIHSLIEESQNQQEANEQELLELDKWASLWNWF-NH2 

1653 Ac - YTSLIHSLIEESANQQEANEQEIiLELDKWASLWNWF - NH2 

1654 AC-YTSLIHSLIEESQAQQEKNEQELLELDKWASLWNWF-NH2 

1655 Ac - YTSLIHSLIEESQNAQEKNEQELLELDKWASLWNWF -NH2 

1656 Ac-YTSLIHALIEESQNQQEKNEQELLEtiDKWASLWNWF-NH2 

1657 AC-YTSLIHSAIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1658 Ac - VYPSDE YDAS ISQVNEE INQALAYIRKADELLENV- NH2 
165 9 AC-YTSLIHSLAEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1660 Ac - YTSAIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1661 AC - YTSLAHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1662 Ac - YTSLIASLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1663 Ac - ATSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1664 AC-YASLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1665 Ac - YTALIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1666 Ac - RIQDLEKYVEDTKIDLWS YNAELLVALENQ - NH2 

1667 Ac -HTIDLTDSEMNKLFEKTRRQLREN-NH2 

1668 Ac-SEMNKLFEKTRRQLREN -NH2 

1669 AC-VFPSDEADASISQVNEKINQSLAFIRKSDELLHNV-NH2 

1670 AC-VFPSDEFAASISQVNEKINQSLAFIRKSDELLHNV-NH2 

1671 Ac - VFPSDEFDASISAVNEKINQSLAFIRKSDELLHNV-NH2 

1672 AC-VFPSDEFDASISQANEKINQSLAFIRKSDELLHNV-NH2 

1673 AC-VFPSDEFDASISQVAEKINQSLAFIRKSDELLHNV-NH2 

1674 Ac - WQEWEQKITAALEQAQIQQEKNEYELQKLDKWASLWEWF -NH2 

1675 AC -WQEWEQKITALAEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1676 Ac - WQEWEQKITALLEQAAIQQEKNEYELQKLDKWASLWEWF -NH2 

1677 Ac - WQEWEQKITALLEQAQAQQEKNE YELQKLDKWASLWEWF -NH2 

1678 Ac - WQEWEQKITALLEQAQIAQEKNEYELQKLDKWASLWEWF-NH2 

1679 Ac - WQEWEQKITALLEQAQIQAEKHEYELQKLDKWASLWEWF -NH2 
168 0 Ac - VFPSDEFDAS ISQVNEKINQSAAFIRKSDELLHNV-NH2 
1681 Ac-VFPSDEFDASISQVNEKINQSIiAAIRKSDELIiHNV-NH2 
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1682 AC-VFPSDEFDASISQVNEKINQSLAFIRKSDEALHNV-NH2 

168 3 Ac - VFPSDEFDASISQVNEKINQSLAFIRKSDEIAHNV-NH2 

1684 AC - VFPSDEFDAS ISQVNEKINQSLAFIRKSDELLANV-NH2 

1685 Ac -WQEWEQKITALLEQAQIQQAKNEYELQKLDKWASLWEWF-NH2 

1687 Ac -WQEWEQKITALLEQAQIQQEKNEYELQALDKWASLWEWF-NH2 

1688 Ac -WQEWEQKITALLEQAQIQffeKNEYELQKADKWASLWEWF-NH2 



10 



15 



20 



25 



30 
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5.4 . SYNTHESIS OF PEPTIDES 
The peptides of the invention may be synthesized 
or prepared by techniques well known in the art. See, 
for example, Creighton, 1983, Proteins: Structures 
and Molecular Principles, W.H. Freeman and Co., NY, 
5 which is incorporated herein by reference in its 
entirety. Short peptides, for example, can be 
synthesized on a solid support or in solution. Longer 
peptides may be made using recombinant DNA techniques . 
Here, the nucleotide sequences encoding the peptides 
10 of the invention may be synthesized, and/or cloned, 
and expressed according to techniques well known to 
those of ordinary skill in the art. See, for example, 
Sambrook, et al. , 198 9, Molecular Cloning, A 
Laboratory Manual, Vols. 1-3, Cold Spring Harbor 
Press, NY. 

15 

The peptides of the invention may alternatively 
be synthesized such that one or more of the bonds 
which link the amino acid residues of the peptides are 
non-peptide bonds. These alternative non-peptide 
bonds may be formed by utilizing reactions well known 

20 to those in the art, and may include, but are not 

limited to imino, ester, hydrazide, semicarbazide, and 
azo bonds, to name but a few. In yet another 
embodiment of the invention, peptides comprising the 
sequences described above may be synthesized with 

25 additional chemical groups present at their amino 
and/or carboxy termini, such that, for example, the 
stability, bioavailability, and/or inhibitory activity 
of the peptides is enhanced. For example, hydrophobic 
groups such as carbobenzoxyl , dansyl, or t- 

3q butyloxycarbonyl groups, may be added to the peptides' 
amino termini. Likewise, an acetyl group or a 9- 
fluorenylmethoxy-carbonyl group may be placed at the 
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peptides' amino termini. (See "X- Tables I to IV, 
above.) Additionally, the hydrophobic group, t- 
butyloxycarbonyl, or an amido group may be added to 
the peptides' carboxy termini. (See "Z" in Tables I 
to IV, above.) 
5 Further, the peptides of the invention may be 

synthesized such that their steric configuration is 
altered. For example, the D-isomer of one or more of 
the amino acid residues of the peptide may be used, 
rather than the usual L-isomer. 
10 Still further, at least one of the amino acid 

residues of the peptides of the invention may be 
substituted by one of the well known non-naturally 
occurring amino acid residues. Alterations such as 
these may serve to increase the stability, 
is bioavailability and/or inhibitory action of the 
peptides of the invention. 

Any of the peptides described above may, 
additionally, have a macromolecular carrier group 
covalently attached to their amino and/or carboxy 
termini . Such macromolecular carrier groups may 
20 include, for example, lipid-fatty acid conjugates, 
polyethylene glycol, carbohydrates or additional 
peptides. "X", in Tables I to IV, above, may 
therefore additionally represent any of the above 
macromolecular carrier groups covalently attached to 
25 the amino terminus of a peptide, with an additional 
peptide group being preferred. Likewise, "Z», in 
Tables I to IV, may additionally represent any of the 
macromolecular carrier groups described above. 

30 5 - 5 - ASSAYS F OR ANT I -MEMBRANE FUSION ACTIVITY 

Described herein, are methods for ability of a 
compound, such as the peptides of the invention, to 
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inhibit membrane fusion events. Specifically, assays 
for cell fusion events are described in Section 5.5.1, 
below, and assays for antiviral activity are described 
in Section 5.5.2, below. 

5 5.5.1 ASSAYS FOR CELL FUSION EVRNTTfi 

Assays for cell fusion events are well known to 
those of skill in the art, and may be used in 
conjunction, for example, with the peptides of the 
invention to test the peptides' antifusogenic 
10 capabilities. 

Cell fusion assays are generally performed in 
vitro. Such an assay may comprise culturing cells 
which, in the absence of any treatment would undergo 
an observable level of syncytial formation. For 
is example, uninfected cells may be incubated in the 
presence of cells chronically infected with a virus 
that induces cell fusion. Such viruses may include, 
but are not limited to, HIV, SIV, or respiratory 
syncytial virus. 

For the assay, cells are incubated in the 
20 presence of a peptide to be assayed. For each 

peptide, a range of peptide concentrations may be 
tested. This range should include a control culture 
wherein no peptide has been added. 

Standard conditions for culturing cells, well 
25 known to those of ordinary skill in the art, are used. 
After incubation for an appropriate period (24 hours 
at 3 7°C, for example) the culture is examined 
microscopically for the presence of multinucleated 
giant cells, which are indicative of cell fusion and 
3Q syncytial formation. Well known stains, such as 

crystal violet stain, may be used to facilitate the 
visualization of syncytial formation. 
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5.5.2 ASSAYS FOR ANTIVIRAL ACTIVITY 
The antiviral activity exhibited by the peptides 
of the invention may be measured, for example, by 
easily performed in vitro assays, such as those 
described below, which can test the peptides' ability 
5 to inhibit syncytia formation, or their ability to 
inhibit infection by cell-free virus. Using these 
assays, such parameters as the relative antiviral 
activity of the peptides, exhibit against a given 
strain of virus and/or the strain specific inhibitory 

10 activity of the peptide can be determined. 

A cell fusion assay may be utilized to test the 
peptides 1 ability to inhibit viral -induced, such as 
HIV- induced, syncytia formation in vitro . Such an 
assay may comprise culturing uninfected cells in the* 
presence of cells chronically infected with a 
syncytial -inducing virus and a peptide to be assayed. 
For each peptide, a range of peptide concentrations 
may be tested. This range should include a control 
culture wherein no peptide has been added. Standard 
conditions for culturing, well known to those of 

20 ordinary skill in the art, are used. After incubation 
for an appropriate period (24 hours at 37°C, for 
example) the culture is examined microscopically for 
the presence of multinucleated giant cells, which are 
indicative of cell fusion and syncytia formation. 

25 Well known stains, such as crystal violet stain, may 
be used to facilitate syncytial visualization. Taking 
HIV as an example, such an assay would comprise CD-4+ 
cells (such as Molt or CEM cells, for example) 
cultured in the presence of chronically HIV-infected 

3Q cells and a peptide to be assayed. 

Other well known characteristics of viral 
infection may also be assayed to test a peptide *s 
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antiviral capabilities. Once again taking HIV as an 
example, a reverse transcriptase (RT) assay may be 
utilized to test the peptides 1 ability to inhibit 
infection of CD-4 + cells by cell-free HIV. Such an 
assay may comprise culturing an appropriate 
5 concentration ( i.e. , TCID S0 ) of virus and CD-4* cells 
in the presence of the peptide to be tested. Culture 
conditions well known to those in the art are used. 
As above, a range of peptide concentrations may be 
used, in addition to a control culture wherein no 

10 peptide has been added. After incubation for an 
appropriate period ( e.g. , 7 days) of culturing, a 
cell-free supernatant is prepared, using standard 
procedures, and tested for the present of RT activity 
as a measure of successful infection. The RT activity 
may be tested using standard techniques such as those 
described by, for example, Goff et al . (Goff, S. et 
al., 1981, J. Virol. 3,8:239-248) and/or Willey et al . 
(Willey, R. et al . , 1988, J. Virol. 62:139-147). 
These references are incorporated herein by reference 
in their entirety. 

20 Standard methods which are well-known to those of 

skill in the art may be utilized for assaying non- 
retroviral activity. See, for example, Pringle et al. 
(Pringle, C.R. et al., 1985, J. Medical Virology 
17:377-386) for a discussion of respiratory syncytial 

25 virus and parainfluenza virus activity assay 

techniques. Further, see, for example, "Zinsser 
Microbiology", 1988, Joklik, W.K. et al., eds., 
Appleton 8c Lange, Norwalk, CT, 19th ed., for a general 
review of such techniques. These references are 
incorporated by reference herein in their entirety. 
In addition, the Examples presented below, in Sections 
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17, 18, 26 and 27 each provide additional assays for 
the testing of a compound's antiviral capability. 

In vivo assays may also be utilized to test, for 
example, the antiviral activity of the peptides of the 
invention. To test for ant i -HIV activity, for 
5 example, the in vivo model described in Barnett et al. 
(Barnett, S.W. et al., 1994, Science 266:642-646) may 
be used. 

Additionally, anti-RSV activity can be assayed in 
vivo via well known mouse models. For example, RSV 

10 can be administered intranasally to mice of various 
inbred strains. Virus replicates in lungs of all 
strains, but the highest titers are obtained in P/N, 
C57L/N and DBA/2N mice. Infection of BALB/c mice 
produces an asymptomatic bronchiolitis characterized" 

15 ^ lymphocytic infiltrates and pulmonary virus titers 
of 10 4 to 10 s pfu/g of lung tissue (Taylor, G. et al., 
1984, Infect. Immun. 42:649-655). 

Cotton rat models of RSV are also well known. 
Virus replicates to high titer in the nose and lungs 
of the cotton rat but produces few if any signs of 

20 inflammation. 

5 - 6 - USES OF THE PEPTIDES OF THE INVENTION" 
The peptides of the invention may be utilized as 
antifusogenic or antiviral compounds, or as compounds 

25 which modulate intracellular processes involving 
coiled coil peptide structures. Further, such 
peptides may be used to identify agents which exhibit 
antifusogenic, antiviral or intracellular modulatory 
activity, still further, the peptides of the 

3q invention may be utilized as organism or viral 
type/subtype-specific diagnostic tools. 
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The antifusogenic capability of the peptides of 
the invention may additionally be utilized to inhibit 
or treat/ameliorate symptoms caused by processes 
involving membrane fusion events. Such events may 
include, for example, virus transmission via cell -cell 
5 fusion, abnormal neurotransmitter exchange via cell- 
fusion, and sperm-egg fusion. Further, the peptides 
of the invention may be used to inhibit free viral, 
such as retroviral, particularly HIV, transmission to 
uninfected cells wherein such viral infection involves 
10 membrane fusion events or involves fusion of a viral 
structure with a cell membrane . Among the 
intracellular disorders involving coiled coil peptides 
structures which may be ameliorated by the peptides of 
the invention are disorders involving, for example, 
bacterial toxins. 

15 

With respect to antiviral activity, the viruses 
whose transmission may be inhibited by the peptides of 
the invention include, but are not limited to human 
retroviruses, such as HIV-l and HIV- 2 and the human T- 
lymphocyte viruses (HTLV-I and II), and non-human 

20 retroviruses such as bovine leukosis virus, feline 

sarcoma and leukemia viruses, simian immunodeficiency, 
sarcoma and leukemia viruses, and sheep progress 
pneumonia viruses. 

Non retroviral viruses whose transmission may be 

25 inhibited by the peptides of the invention include, 
but are not limited to human respiratory syncytial 
virus, canine distemper virus, newcastle disease 
virus, human parainfluenza virus, influenza viruses, 
measles viruses, Epstein-Barr viruses, hepatitis B 

30 viruses ' and simian Mason-pfizer viruses. 

Non enveloped viruses whose transmission may be 
inhibited by the peptides of the invention include, 
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but are not limited to picornaviruses such as polio 
viruses, hepatitis A virus, enterovirus, echoviruses 
and coxsackie viruses, papovaviruses such as papilloma 
virus, parvoviruses, adenoviruses and reoviruses. 

As discussed more fully, below, in Section 5.6.1 
5 and in the Example presented, below, in Section 8, 
DP107, DP178, DP107 analog and DP178 analog peptides 
form non-covalent protein-protein interactions which 
are required for normal activity of the virus. Thus, 
the peptides of the invention may also be utilized as 
10 components in assays for the identification of 

compounds that interfere with such protein-protein 
interactions and may, therefore, act as antiviral 
agents. These assays are discussed, below, in Section 
5.6.1. 

15 As demonstrated in the Example presented below in 

Section 6, the antiviral activity of the peptides of 
the invention may show a pronounced type and subtype 
specificity, i^, specific peptides may be effective 
in inhibiting the activity of only specific viruses. 
This feature of the invention presents many 

20 advantages. One such advantage, for example, lies in 
the field of diagnostics, wherein one can use the 
antiviral specificity of the peptide of the invention 
to ascertain the identity of a viral isolate, with 
respect to HIV, one may easily determine whether a 

25 viral isolate consists of an HIV-l or HIV- 2 strain. 
For example, uninfected CD-4* cells may be co-infected 
with an isolate which has been identified as 
containing HIV the DP178 (SEQ ID:1) peptide, after 
which the retroviral activity of cell supernatants may 

3() be assayed, using, for example, the techniques 

described above in Section 5.2. Those isolates whose 
retroviral activity is completely or nearly completely 
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inhibited contain HIV-1. Those isolates whose viral 
activity is unchanged or only reduced by a small 
amount, may be considered to not contain HIV-l. Such 
an isolate may then be treated with one or more of the 
other DP178 peptides of the invention, and 
5 subsequently be tested for its viral activity in order 
to determine the identify of the viral isolate. The 
DP107 and DP178 analogs of the invention may also be 
utilized in a diagnostic capacity specific to the type 
and subtype of virus or organism in which the specific 
10 peptide sequence is found. A diagnostic procedure as 
described, above, for DP178, may be used in 
conjunction with the DP107/DP178 analog of interest. 

5.6.1. SCREENING ASSAYS 
As demonstrated in the Example presented in 
Section 8, below, DP107 and DP178 portions of the TM 
protein gp41, i.e., the HR1 and HR2 portions of gp41, 
respectively, form non-covalent protein-protein 
interactions. As is also demonstrated, the 
maintenance of such interactions is necessary for 

20 normal viral infectivity. Thus, compounds which bind 
DP107; bind DP178, and/or act to disrupt normal 
DP107/DP178 protein-protein interactions may act as 
antifusogenic, antiviral or cellular modulatory 
agents. Described below are assays for the 

25 identification of such compounds. Note that, while, 
for ease and clarity of discussion, DP107 and DP178 
peptides will be used as components of the assays 
described, but it is to be understood that any of the 
DP107 analog or DP178 analog peptides described, 
above, in Sections 5.1 through 5.3 may also be 

30 

utilized as part of these screens for compounds. 
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For example, in certain embodiments the assays of 
the invention may be use DP107 and/or DP178 analogs 
that contain one or more amino acid residue 
truncations, deletions, insertions or substitutions. 
In particular, in one preferred embodiment, the DP107, 
5 DP178, DP107-like and DP178-like peptides can comprise 
amino and/or carboxy- terminal insertions corresponding 
to about two to about fifty amino acids amino-to or 
carboxy-to the endogenous sequence from which the 
DP107, DP178, DP107-like or DP178-like peptide is 
10 derived. In another particular embodiment, the 

peptides used in the assays described herein further 
comprise additional, heterologous sequence useful for 
detecting, immobilizing and/or purifying the 
particular peptide. Such heterologous sequences 
i5 include, but are not limited to maltose binding fusion 
proteins containing a DP178, DP107, DPl78-like or 
DP107-like sequence such as the M41A178 and MF5.1 
maltose binding fusion proteins described in Sections 
8 and 30, below. 

In certain embodiments, such analogs will have 
20 reduced binding affinities and are therefore useful, 
e.g., to screen for compounds which inhibit the 
formation of or, alternatively, disrupt complexes 
between DP107/DP178 complexes. Among such reduced 
binding analogs are peptides exhibiting one or more 
25 alanine insertion or substitutions, including, e.g., 
the peptides described in the examples presented in 
Sections 30 and 31, below. It is understood that such 
analogs which have reduced binding affinities, 
including the analogs described in Sections 30 and 31 
30 below, are also part of the present invention. 
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Compounds which may be tested for an ability to 
bind DP107, DP178, and/or disrupt DP107/DP178 
interactions, and which therefore, potentially 
represent antifusogenic, antiviral or intracellular 
modulatory compounds, include, but are not limited to, 
5 peptides made of D- and/or L-conf iguration amino acids 
(in, for example, the form of random peptide 
libraries; see Lam, K.S. et al . , 1991, Nature 354:82- 
84), phosphopeptides (in, for example, the form of 
random or partially degenerate, directed 
10 phosphopeptide libraries; see, for example, Songyang, 
Z. et al., 1993, Cell 72:767-778), antibodies, and 
small organic or inorganic molecules. Synthetic 
compounds, natural products, and other sources of 
potentially effective materials may be screened in a 
variety of ways, as described in this Section. 

Compounds that can be screened, tested and 
identified as modulating HR1/HR2, DP178/DP107 and/or 
DP178-like/DP107-like interactions utilizing the 
methods described herein can, in general, include, 
^ e.g., small molecules that are of a molecular weight 
up to about 1500 daltons. Test compounds, including 
small molecules, can include, but are not limited to, 
compounds obtained from any commercial source, 
including Aldrich (1001 West St. Paul Ave., Milwaukee, 
^ WI 53233), Sigma Chemical (P.O. Box 14508, St. Louis, 
MO 63178), Fluka Chemie AG (Industriestrasse 25, CH- 
9471 Buchs, Switzerland (Fluka Chemical Corp. 980 
South 2nd Street, Ronkonkoma, NY 11779)), Eastman 
Chemical Company, Fine Chemicals (P.O Box 431, 
Kingsport, TN 37662), Boehringer Mannheim GmbH 

50 

(Sandhofer Strasse 116, D-68298 Mannheim), Takasago (4 
Volvo Drive, Rockleigh, NJ 07647), SST Corporation 
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(635 Brighton Road, Clifton, NJ 07012), Ferro (111 
West Irene Road, Zachary, LA 70791) , Riedel-deHaen 
Aktiengesellschaft (P.O. Box D-30918, Seelze, 
Germany), PPG Industries Inc., Fine Chemicals (One PPG 
Place, 34th Floor, Pittsburgh, PA 15272). Further any 
kind of natural products may be screened using the 
methods of the invention, including microbial, fungal 
or plant extracts. 

Furthermore, diversity libraries of test 
compounds, including small molecule test compounds, 
may be commercially obtained from Specs and BioSpecs 
B.V. (Rijswijk, The Netherlands) , Chembridge 
Corporation (San Diego, CA) , Contract Service Company 
(Dolgoprudny, Moscow Region, Russia) , Comgenex USA " " 
Inc. (Princeton, NJ) , Maybridge Chemicals Ltd. 
(Cornwall PL34 OHW, United Kingdom) , and Asinex 
(Moscow, Russia) . Combinatorial libraries of test 
compounds, including small molecule test compounds, 
can be may be generated as disclosed in Eichler & 
Houghten, 1995, Mol. Med. Today 1: 174-180 ; Dolle, 
1997, Mol. Divers. 2:223-236; Lam, 1997, Anticancer 
Drug- Des. 12:145-167. These references are 
incorporated hereby by reference in their entirety. 
It is to be noted that such references also teach 
additional screening methods which may be employed for 
the further testing of compounds identified via the 
methods of the invention and which can aid in 
identifying and isolating compounds which can 
represent leads and therapeutic compounds. 
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The compounds, antibodies, or other molecules 
identified may be tested, for example, for an ability 
to inhibit cell fusion or viral activity, utilizing, 
for example, assays such as those described, above, in 
Section 5.5. 

5 Among the peptides which may be tested are 

soluble peptides comprising DP107 and/or DP178 
domains, and peptides comprising DP107 and/or DP178 
domains having one or more mutations within one or 
both of the domains, such as the M41-P peptide 
10 described, below, in the Example presented in Section 
8, which contains a isoleucine to proline mutation 
within the DP178 sequence. 

In one embodiment of such screening methods is a 
method for identifying a compound to be tested for 
antiviral ability comprising: 

1 exposing at least one compound to a 
peptide comprising a DP107 peptide for a time 
sufficient to allow binding of the compound to the 
DP107 peptide; 

2 removing non-bound compounds; and 

2 0 

3 determining the presence of the 
compound bound to the DP107 peptide, 
thereby identifying an agent to be tested for 
antiviral ability. 

In a second embodiment of such screening methods 
25 is a method for identifying a compound to be tested 
for antiviral ability comprising: 

(a) exposing at least one compound to a 
peptide comprising a DP178 peptide for a time 
sufficient to allow binding of the compound to the 

3Q DP178 peptide; 

(b) removing non-bound compounds; and 
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(c) determining the presence of the 
compound bound to the DP178 peptide, 
thereby identifying an agent to be tested for 
antiviral ability. 

One method utilizing these types of approaches 
5 that may be pursued in the isolation of such DP107- 
binding or DP178 -binding compounds is an assay which 
would include the attachment of either the DP107 or 
the DP178 peptide to a solid matrix, such as, for 
example, agarose or plastic beads, microtiter plate 
10 wells, petri dishes, or membranes composed of, for 
example, nylon or nitrocellulose. In such an assay 
system, either the DP107 or DP178 protein may be 
anchored onto a solid surface, and the compound, or 
test substance, which is not anchored, is labeled, 
either directly or indirectly (e.g., with a 
radioactive label such as 125 l, an absorption label 
such as biotin, or a fluorescent label such as 
fluorescein or rhodamine) . In practice, microtiter 
plates are conveniently utilized. The anchored 
component may be immobilized by non-covalent or 

20 

covalent attachments. Non-covalent attachment may be 
accomplished simply by coating the solid surface with 
a solution of the protein and drying. Alternatively, 
an immobilized antibody, preferably a monoclonal 
antibody, specific for the protein may be used to 

25 anchor the protein to the solid surface. The surfaces 
may be prepared in advance and stored. 

In order to conduct the assay, the labeled 
compound is added to the coated surface containing the 
anchored DP107 or DP178 peptide. After the reaction 

3o is complete, unreacted components are removed ( e.g. , 
by washing) under conditions such that any complexes 
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formed will remain immobilized on the solid surface. 
The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. 
Where the compound is pre-labeled, the detection of 
label immobilized on the surface indicates that 
5 complexes were formed. Where the labeled component is 
not pre-labeled, an indirect label can be used to 
detect complexes anchored on the surface; e.g. , using 
a labeled antibody specific for the compound (the 
antibody, in turn, may be directly labeled or 
10 indirectly labeled with a labeled anti-Ig antibody) . 

Alternatively, such an assay can be conducted in 
a liquid phase, the reaction products separated from 
unreacted components, and complexes detected; e.g. , 
using an immobilized antibody specific for DP107 or 
DP178, whichever is appropriate for the given assay, 
or ab antibody specific for the compound, i.e. , the 
test substance, in order to anchor any complexes 
formed in solution, and a labeled antibody specific 
for the other member of the complex to detect anchored 
complexes . 

20 By utilizing procedures such as this, large 

numbers of types of molecules may be simultaneously 
screened for DP107 or DP178 -binding capability, and 
thus potential antiviral activity. 

Further, compounds may be screened for an ability 

25 to inhibit the formation of or, alternatively, disrupt 
DP107/DP178 complexes. Such compounds may then be 
tested for antifusogenic, antiviral or intercellular 
modulatory capability. For ease of description, DP107 
and DP178 will be referred to as "binding partners." 

^ Compounds that disrupt such interactions may exhibit 
antiviral activity. Such compounds may include, but 
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are not limited to molecules such as antibodies, 
peptides, and the like described above. 

The basic principle of the assay systems used to 
identify compounds that interfere with the interaction 
between the DP107 and DP178 peptides involves 
5 preparing a reaction mixture containing peptides under 
conditions and for a time sufficient to allow the two 
peptides to interact and bind, thus forming a complex. 
In order to test a compound for disruptive activity, 
the reaction is conducted in the presence and absence 
10 of the test compound, i.e. , the test compound may be 
initially included in the reaction mixture, or added 
at a time subsequent to the addition of one of the 
binding partners; controls are incubated without the 
test compound or with a placebo. The formation of any 
complexes between the binding partners is then 
detected. The formation of a complex in the control 
reaction, but not in the reaction mixture containing 
the test compound indicates that the compound 
interferes with the interaction of the DP107 and DP178 
peptides . 

2{J The assay for compounds that interfere with the 

interaction of the binding partners can be conducted 
in a heterogeneous or homogeneous format. 
Heterogeneous assays involve anchoring one of the 
binding partners onto a solid phase and detecting 

25 complexes anchored on the solid phase at the end of 
the reaction. In homogeneous assays, the entire 
reaction is carried out in a liquid phase. In either 
approach, the order of addition of react ants can be 
varied to obtain different information about the 

3q compounds being tested. For example, test compounds 
that interfere with the interaction between the 
binding partners, e.g. , by competition, can be 
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identified by conducting the reaction in the presence 
of the test substance; i.e. , by adding the test 
substance to the reaction mixture prior to or 
simultaneously with the binding partners. On the 
other hand, test compounds that disrupt preformed 
5 complexes, e.g. compounds with higher binding 

constants that displace one of the binding partners 
from the complex, can be tested by adding the test 
compound to the reaction mixture after complexes have 
been formed. The various formats are described 
10 briefly below. 

In a heterogeneous assay system, one binding 
partner, e.g. , either the DP107 or DP178 peptide, is 
anchored onto a solid surface, and its binding 
partner, which is not anchored, is labeled, either 
directly or indirectly (e.g., with a radioactive label 
such as 125 l, an absorption label such as biotin, or a 
fluorescent label such as fluorescein or rhodamine) . 
In practice, microtiter plates are conveniently 
utilized. The anchored species may be immobilized by 
non-covalent or covalent attachments. Non-covalent 

2 0 

attachment may be accomplished simply by coating the 
solid surface with a solution of the protein and 
drying. Alternatively, an immobilized antibody 
specific for the protein may be used to anchor the 
protein to the solid surface. The surfaces may be 

25 prepared in advance and stored. 

In order to conduct the assay, the binding 
partner of the immobilized species is added to the 
coated surface with or without the test compound. 
After the reaction is complete, unreacted components 

3o are removed (e.g. , by washing) and any complexes 

formed will remain immobilized on the solid surface. 
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The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. 
Where the binding partner was pre -labeled, the 
detection of label immobilized on the surface 
indicates that complexes were formed. Where the 
5 binding partner is not pre-labeled, an indirect label 
can be used to detect complexes anchored on the 
surface; e.g. , using a labeled antibody specific for 
the binding partner (the antibody, in turn, may be 
directly labeled or indirectly labeled with a labeled 
10 anti-Ig antibody) . Depending upon the order of 

addition of reaction components, test compounds which 
inhibit complex formation or which disrupt preformed 
complexes can be detected. 

Alternatively, the reaction can be conducted in a 
liquid phase in the presence or absence of the test 
compound, the reaction products separated from 
unreacted components, and complexes detected; e.g. , 
using an immobilized antibody specific for one binding 
partner to anchor any complexes formed in solution, 
and a labeled antibody specific for the other binding 
20 partner to detect anchored complexes. Again, 

depending upon the order of addition of react ants to 
the liquid phase, test compounds which inhibit complex 
or which disrupt preformed complexes can be 
identified. 

25 In an alternate embodiment of the invention, a 

homogeneous assay can be used. In this approach, a 
preformed complex of the DP107 and DP178 peptides is 
prepared in which one of the binding partners is 
labeled, but the signal generated by the label is 

3q quenched due to complex formation (see, e.g. . U.S. 

Patent No. 4,109,496 by Rubenstein which utilizes this 
approach for immunoassays) . The addition of a test 
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substance that competes with and displaces one of the 
binding partners from the preformed complex will 
result in the generation of a signal above background. 
In this way, test substances which disrupt DP- 107/ 
DP- 178 protein-protein interaction can be identified. 
5 In still another embodiment of the invention, 

fluorescence polarization may be used in a homogenous 
assay. In this approach, complex formation is 
detected by measuring the polarization of a 
fluorescently labeled peptide (e.g., with fluorescein 
10 or rhodamine) in a sample. Binding of the peptide to 
its complementary HR1 or HR2 binding domain in a 
larger molecular weight peptide or protein, such as in 
a maltose binding fusion protein described herein, 
alters the correlation time of the fluorescent moiety 

is and thereby decreases the fluroescence polarization of 
the labeled peptide. 

In an alternative screening assay, test compounds 
may be assayed for the their ability to disrupt a 
DP178/DP107 interaction, as measured immunometrically 
using an antibody specifically reactive to a 

20 DP107/DP178 complex ( i.e. . an antibody that recognizes 
neither DP107 nor DP178 individually) . Such an assay 
acts as a competition assay, and is based on 
techniques well known to those of skill in the art. 

The above competition assay may be described, by 

25 way of example, and not by way of limitation, by using 
the DP178 and M41A178 peptides and by assaying test 
compounds for the disruption of the complexes formed 
by these two peptides by immunometrically visualizing 
DP178/M41A178 complexes via the human recombinant Fab, 

3Q Fab-d, as described, below, in the Example presented 
in Section 8. M41A178 is a maltose binding fusion 
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protein containing a gp4l region having its DP178 

domain deleted, and is described, below, in the 

Example presented in Section 8. 

Utilizing such an assay, M41A178 may be 

immobilized onto solid supports such as microtiter 
5 wells. A series of dilutions of a test compound may 

then be added to each M41A178 -containing well in the 

presence of a constant concentration of DP- 178 
peptide. After incubation, at, for example, room 

temperature for one hour, unbound DP- 178 and test 
10 compound are removed from the wells and wells are then 
incubated with the DP178/M41A178 -specif ic Fab-d 
antibody. After incubation and washing, unbound Fab-d 
is removed from the plates and bound Fab-d is 
quantitated. A no-inhibitor control should also be 
conducted. Test compounds showing an ability to 
disrupt DP178/M41A178 complex formation are identified 
by their concentration-dependent decrease in the level 
of Fab-d binding. 

A variation of such an assay may be utilized to 

perform a rapid, high- throughput binding assay which 
2 0* 

is capable of directly measuring DP178 binding to 
M41A178 for the determination of binding constants of 
the ligand of inhibitory constants for competitors of 
DP178 binding. 

Such an assay takes advantage of accepted 

25 radioligand and receptor binding principles. (See, 
for example, Yamamura, H.I. et al., 1985, 
"Neurotransmitter Receptor Binding", 2nd ed., Raven 
Press, NY.) As above, M41A178 is immobilized onto a 
solid support such as a microtiter well. DP178 

3q binding to M41A178 is then quantitated by measuring 
the fraction of DP178 that is bound as 125 I-DP178 and 
calculating the total amount bound using a value for 
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specific activity (dpm//ig peptide) determined for each 
labeled DP178 preparation. Specific binding to 
M41A178 is defined as the difference of the binding of 
the labeled DP178 preparation in the microtiter wells 
(totals) and the binding in identical wells 
5 containing, in addition, excess unlabeled DP178 
(nonspecif ics) . 

Because the binding affinity for native DP17 8 and 
DP107 is very high (including native DP178-like and 
DP107-like peptides from other species; e.g., 10 nM 

10 for DP178 in HIV-1, and 2 nM for T112 in RSV) , test 
compounds must exhibit high binding properties to 
interfere with or disrupt the DP178/DP107 binding 
interaction. Accordingly, in another non-limiting 
example of the above-described competitions assays, 
such assays can be performed using "modified" DP107 
and/or DP178 peptides (e.g., DP107 and/or DP178 
analogs) which have reduced binding affinities 
relatived to the unmodified "parent peptides". The 
use of such modified DP107 and DP178 peptides greatly 
increases the sensitivity of the competition assays of 

20 the invention by identifying more compounds with 
inhibitory potential. The binding affinities of 
compounds identified in the assays can then be 
optimized, e.g., using standard medicinal chemistry 
techniques, to generate compounds that are more 

25 powerful inhibitors of DP107/DP178 complex formation 
and are therefore useful, e.g., as antiviral reagents. 
Alternatively, compounds identified in the competition 
assays using DP107 and/or DP17 8 analogs with reduced 
binding affinities may, themselves, be useful, e.g., 

. n as antiviral reagents. 
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The term "reduced affinity," as used herein, 
refers to a DP107, DP178, DP107-like or DP178-like 
peptide that interacts with and forms a DP107/DP178 
peptide pair, a HR1/DP178 pair or an HR2/DP107 pair 
under competition assay conditions, but interacts with 
5 its "partner" to form such a pair with a lower . 

affinity than would a DP107 or DP178 "parent" peptide 
from which the reduced affinity peptide is derived. 

Generally, the binding affinity of a peptide can 
be expressed as a B 50 value, i.e., the concentration of 
10 peptide necessary for 50% of the peptide molecules to 
bind to their target under a given set of conditions. 
Preferably, the B 50 value of a reduced affinity peptide 
will by at least twice, and more preferably at least 
five times, at least 10 times, at least 20 times, or 
is at least 10 0 times the B 50 value of the unmodified 
peptide from which it was derived. 

Modified DP107 and DP178 peptides that have 
reduced binding affinities may be generated according 
to any number of techniques that will be readily 
apparent to those skilled in the art. For example, in 

20 

one embodiment modified DP107 and DP178 peptides with 
reduced binding affinities may be generated by 
generating truncated DP10 7 and DP178 peptides, 
respectively. Such peptides may be routinely 
synthesized and tested, e.g., by the above described 

25 screening assays, to determine their binding 

affinities to their target. For example, as described 
in the example presented below in Section 30, reducing 
the length of the native RSV DP178-like peptide T112 
from 35 to 2 8 amino acid residues resulted in a five 

3Q fold drop in binding affinity (from 1 nM to 5 nM) . 
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Generally, such truncation can be of 1, 2, 3, 4, 5, 6, 
7, 8, 9 or 10 amino acid residues. 

Alternatively, modified DP107 and DP178 peptides 
with reduced binding affinity may be identified and 
generated by identifying and substituting, inserting 
5 or deleting amino acid residues. For example in one 
embodiment, which is also demonstrated in the example 
presented below in Section 30, modified DP107 and/or 
DP178 peptides may be routinely synthesized and 
assayed for reduced binding affinity by systematically 

10 replacing one or more amino acid residues of the 

native DP107 or DP178 peptide with other amino acid 
residues and testing the binding affinity of the 
resulting peptide by techniques such as those 
described herein. Preferably, the substituted amino 
acid residues are neutral amino acid residues 
exhibiting relatively small side chains, such as 
alanine or glycine. 

Such substitutions can identify "key" amino acid 
residues and can be used in the competition assays of 
the invention. Alternatively, upon identification of 

20 key residues by such systematic substitutions, the key 
residues can be changed to other residues and the 
resulting, modified peptides can be tested for binding 
affinity. 

Modified DP107 and/or DP178 peptides that have 
25 reduced binding affinities may still further be 

identified using principles of protein chemistry and 
design that are well known to those of skill in the 
art. Specifically, such principles may be used to 
identify those amino acid residues of a native DP107 
3q or DP178 sequence that effect, e.g., solubility, 

binding affinity, or stability of the peptide. Thus, 
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for example, using known principles of amino acid 
chemistry and protein design one skilled in the art 
could identify amino acid residues in a native DP107 
or DP178 peptide that affect the structure of the 
peptide . 

5 

5.7 PHARMACEUTICAL FORMULATIONS, DOSAGES 
AND MODES OF ADMINISTRATION 

The peptides of the invention may be administered 
using techniques well known to those in the art. 
io Preferably, agents are formulated and administered 
systemically. Techniques for formulation and 
administration may be found in "Remington's 
Pharmaceutical Sciences", 18th ed., 1990, Mack 
Publishing Co., Easton, PA. Suitable routes may 
include oral, rectal, transmucosal, or intestinal 
administration; parenteral delivery, including 
intramuscular, subcutaneous, intramedullary 
injections, as well as, intrathecal, direct 
intraventricular, intravenous, intraperitoneal, 
intranasal, or intraocular injections, just to name a 
2 0 few. For injection, the agents of the invention may 
be formulated in aqueous solutions, preferably in 
physiologically compatible buffers such as Hanks' 
solution, Ringer's solution, or physiological saline 
buffer. For such transmucosal administration, 
25 penetrants appropriate to the barrier to be permeated 
are used in the formulation. Such penetrants are 
generally known in the art. 

In instances wherein intracellular administration 
of the peptides of the invention or other inhibitory 
agents is preferred, techniques well known to those of 
30 ordinary skill in the art may be utilized. For 
example, such agents may be encapsulated into 
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liposomes, then administered as described above. 
Liposomes are spherical lipid bilayers with aqueous 
interiors. All molecules present in an aqueous 
solution at the time of liposome formation are 
incorporated into the aqueous interior. The liposomal 
5 contents are both protected from the external 

microenvironment and, because liposomes fuse with cell 
membranes, are effectively delivered into the cell 
cytoplasm. Additionally, due to their hydrophobic ity, 
when small molecules are to be administered, direct 
10 intracellular administration may be achieved. 

Nucleotide sequences encoding the peptides of the 
invention which are to be intracellularly administered 
may be expressed in cells of interest, using 
techniques well known to those of skill in the art. 
For example, expression vectors derived from viruses 
such as retroviruses, vaccinia viruses, adeno- 
associated viruses, herpes viruses, or bovine 
papilloma viruses, may be used for delivery and 
expression of such nucleotide sequences into the 
targeted cell population. Methods for the 

2 0 

construction of such vectors and expression constructs 
are well known. See, for example, Sambrook et al., • 
1989, Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Press, Cold Spring Harbor NY, and 
Ausubel et al., 1989, Current Protocols in Molecular 

25 Biology, Greene Publishing Associates and Wiley 
Interscience, NY. 

With respect to HIV, peptides of the invention, 
particularly DP107 and DP178, may be used as 
therapeutics in the treatment of AIDS. In addition, 

3Q the peptides may be used as prophylactic measures in 
previously uninfected individuals after acute exposure 
to an HIV virus. Examples of such prophylactic use of 
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the peptides may include, but are not limited to, 
prevention of virus transmission from mother to infant 
and other settings where the likelihood of HIV 
transmission exists, such as, for example, accidents 
in health care settings wherein workers are exposed to 
5 HIV- containing blood products. The successful use of 
such treatments do not rely upon the generation of a 
host immune response directed against such peptides. 

Effective dosages of the peptides of the 
invention to be administered may be determined through 
10 procedures well known to those in the art which 
address such parameters as biological half-life, 
bioavailability, and toxicity. Given the data 
presented below in Section 6, DP178, for example, may 
prove efficacious in vivo at doses required to achieve 
is circulating levels of about 1 to about 10 ng per ml of 
peptide . 

A therapeutically effective dose refers to that 
amount of the compound sufficient to result in 
. amelioration of symptoms or a prolongation of survival 
in a patient. Toxicity and therapeutic efficacy of 
such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or 
experimental animals, e^, for determining the LD 50 
(the dose lethal to 50% of the population) and the 
ED50 (the dose therapeutically effective in 50% of the 
25 population) . The dose ratio between toxic and 

therapeutic effects is the therapeutic index and it 
can be expressed as the ratio LD S0 /ED S0 . Compounds 
which exhibit large therapeutic indices are preferred. 
The data obtained from these cell culture assays and 
animal studies can be used in formulating a range of 
dosage for use in humans. The dosage of such 
compounds lies preferably within a range of 
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circulating concentrations that include the ED50 with 
little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and 
the route of administration utilized. For any 
compound used in the method of the invention, the 
5 therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be 
formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC S0 
(e.g./ the concentration of the test compound which 
10 achieves a half -maximal inhibition of the fusogenic 
event, such as a half -maximal inhibition of viral 
infection relative to the amount of the event in the 
absence of the test compound) as determined in cell 
culture. Such information can be used to more 
accurately determine useful doses in humans. Levels 
in plasma may be measured, for example, by high 
performance liquid chromatography (HPLC) . 

The peptides of the invention may, further, serve 
the role of a prophylactic vaccine, wherein the host 
raises antibodies against the peptides of the 
20 invention, which then serve to neutralize HIV viruses 
by, for example, inhibiting further HIV infection. 

Administration of the peptides of the invention 
as a prophylactic vaccine, therefore, would comprise 
administering to a host a concentration of peptides 
2 5 effective in raising an immune response which is 
sufficient to neutralize HIV, by, for example, 
inhibiting HIV ability to infect cells. The exact 
concentration will depend upon the specific peptide to 
be administered, but may be determined by using 
30 standard techniques for assaying the development of an 
immune response which are well known to those of 
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ordinary skill in the art. The peptides to be used as 
vaccines are usually administered intramuscularly. 

The peptides may be formulated with a suitable 
adjuvant in order to enhance the immunological 
response. Such adjuvants may include, but are not 
5 limited to mineral gels such as aluminum hydroxide; 
surface active substances such as lysolecithin, 
pluronic polyols, polyanions; other peptides; oil 
emulsions; and potentially useful human adjuvants such 
as BCG and Corynebacterium parvum. Many methods may 
10 be used to introduce the vaccine formulations 

described here. These methods include but are not 
limited to oral, intradermal, intramuscular, 
intraperitoneal, intravenous, subcutaneous, and 
intranasal routes. 

15 Alternatively, an effective concentration of 

polyclonal or monoclonal antibodies raised against the 
peptides of the invention may be administered to a 
host so that no uninfected cells become infected by 
HIV. The exact concentration of such antibodies will 

^ vary according to each specific antibody preparation, 
but may be determined using standard techniques well 
known to those of ordinary skill in the art. 
Administration of the antibodies may be accomplished 
using a variety of techniques, including, but not 
limited to those described in this section. 
2 5 For all such treatments described above, the 

exact formulation, route of administration and dosage 
can be chosen by the individual physician in view of 
the patient's condition. (See e^ Fingl et al., 

1975, in "The Pharmacological Basis of Therapeutics", 
Ch. 1 pi) . 

It should be noted that the attending physician 
would know how to and when to terminate, interrupt, or 
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adjust administration due to toxicity, or to organ 
dysfunctions. Conversely, the attending physician 
would also know to adjust treatment to higher levels 
if the clinical response were not adequate (precluding 
toxicity) . The magnitude of an administrated dose in 
5 the management of the oncogenic disorder of interest 
will vary with the severity of the condition to be 
treated and the route of administration. The dose and 
perhaps dose frequency, will also vary according to 
the age, body weight, and response of the individual 

10 patient. A program comparable to that discussed above 
may be used in veterinary medicine. 

Use of pharmaceutical^ acceptable carriers to 
formulate the compounds herein disclosed for the 
practice of the invention into dosages suitable for 
systemic administration is within the scope of the 
invention. With proper choice of carrier and suitable 
manufacturing practice, the compositions of the 
present invention, in particular, those formulated as 
solutions, may be administered parenterally, such as 
by intravenous injection. The compounds can be 
formulated readily using pharmaceutical ly acceptable 
carriers well known in the art into dosages suitable 
for oral administration. Such carriers enable the 
compounds of the invention to be formulated as 
tablets, pills, capsules, liquids, gels, syrups, 

25 slurries, suspensions and the like, for oral ingestion 
by a patient to be treated. 

Pharmaceutical compositions suitable for use in 
the present invention include compositions wherein the 
active ingredients are contained in an effective 

3q amount to achieve its intended purpose. Determination 
of the effective amounts is well within the capability 
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of those skilled in the art, especially in light of 
the detailed disclosure provided herein. 

In addition to the active ingredients, these 
pharmaceutical compositions may contain suitable 
pharmaceutically acceptable carriers comprising 
5 excipients and auxiliaries which facilitate processing 
of the active compounds into preparations which can be 
used pharmaceutically. The preparations formulated 
for oral administration may be in the form of tablets, 
dragees, capsules, or solutions. 
10 The pharmaceutical compositions of the present 

invention may be manufactured in a manner that is 
itself known, e_^, by means of conventional mixing, 
dissolving, granulating, dragee -making, levigating, 
emulsifying, encapsulating, entrapping or lyophilizing 
processes . 

Pharmaceutical formulations for parenteral 
administration include aqueous solutions of the active 
compounds in water-soluble form. Additionally, 
suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable 
lipophilic solvents or vehicles include fatty oils 
such as sesame oil, or synthetic fatty acid esters, 
such as ethyl oleate or triglycerides, or liposomes. 
Aqueous injection suspensions may contain substances 
which increase the viscosity of the suspension, such 
25 as sodium carboxymethyl cellulose, sorbitol, or 

dextran. Optionally, the suspension may also contain 
suitable stabilizers or agents which increase the 
solubility of the compounds to allow for the 
preparation of highly concentrated solutions. 
3Q Pharmaceutical preparations for oral use can be 

obtained by combining the active compounds with solid 
excipient, optionally grinding a resulting mixture, 
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and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or 
dragee cores. Suitable excipients are, in particular, 
fillers such as sugars, including lactose, sucrose, 
mannitol, or sorbitol; cellulose preparations such as, 
5 for example, maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl 
cellulose, hydroxypropylmethyl -cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone 
(PVP) . if desired, disintegrating agents may be 
10 added, such as the cross-linked polyvinyl pyrrolidone, 
agar, or alginic acid or a salt thereof such as sodium 
alginate. 

Dragee cores are provided with suitable coatings. 
For this purpose, concentrated sugar solutions may be 

is used, which may optionally contain gum arable, talc, 
polyvinyl pyrrolidone, carbopol gel, polyethylene 
glycol, and/or titanium dioxide, lacquer solutions, 
and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or 
dragee coatings for identification or to characterize 
different combinations of active compound doses. 

Pharmaceutical preparations which can be used 
orally include push-fit capsules made of gelatin, as 
well as soft, sealed capsules made of gelatin and a 
plasticizer, such as glycerol or sorbitol. The 

25 push- fit capsules can contain the active ingredients 
in admixture with filler such as lactose, binders such 
as starches, and/or lubricants such as talc or 
magnesium stearate and, optionally, stabilizers. In 
soft capsules, the active compounds may be dissolved 

3o or suspended in suitable liquids, such as fatty oils, 
liquid paraffin, or liquid polyethylene glycols. In 
addition, stabilizers may be added. 
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6. EXAMPLE: DP178 (SEQ ID:1) IS A POTENT 
INHIBITO R OF WTV-1. INFEPTTOTJ 



In this example, DP178 (SEQ ID:1) is shown to be 
a potent inhibitor of HIV-l mediated CD-4* cell-cell 
fusion and infection by cell free virus, m the 
5 fusion assay, this peptide completely blocks virus 
induced syncytia formation at concentrations of from 
1-10 ng/ml. In the infectivity assay the inhibitory 
concentration is somewhat higher, blocking infection 
at 90ng/ml. It is further shown that DP178 (SEQ ID:1) 
iq shows that the antiviral activity of DP178 (SEQ ID:1) 
is highly specific for HIV-l. Additionally, a 
synthetic peptide, DP-185 (SEQ ID:3), representing a 
HIV-l-derived DP178 homolog is also found to block 
HIV-l-mediated syncytia formation. 



15 



6 -l- MATERIALS AND MRTffnnc 



6.1.1. PEPTIDE SYNTHESTfi 
Peptides were synthesized using Past Moc 
chemistry on an Applied Biosystems Model 43 1A peptide 
20 synthesizer. Generally, unless otherwise noted, the 
peptides contained amidated carboxy termini and 
acetylated amino termini. Amidated peptides were 
prepared using Rink resin (Advanced Chemtech) while 
peptides containing free carboxy termini were 
25 synthesized on Wang (p-alkoxy-benzyl-alcohol) resin 
(Bachem) . First residues were double coupled to the 
appropriate resin and subsequent residues were single 
coupled. Each coupling step was followed by acetic 
anhydride capping. Peptides were cleaved from the 
resin by treatment with trifluoracetic acid (TFA) 
(10ml), Hs O (0.5ml), thioanisole (0.5ml), ethanedithiol 
(0.25ml), and crystalline phenol (o.75g). Purifi- 
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cation was carried out by reverse phase HPLC. 
Approximately 50mg samples of crude peptide were 
chromatographed on a Waters Delta Pak C18 column (19mm 
x 3 0cm, 15 fi spherical) with a linear gradient; 
H 2 0/acetonitrile 0.1% TFA. Lyophilized peptides were 
5 stored desiccated and peptide solutions were made in 
water at about lmg/ml . Electrospray mass spectrometry 
yielded the following results: DP178 (SEQ 
ID:1) :4491.87 (calculated 4491 . 94 ) ; DP-180 (SEQ 
ID:2) :4491.45 (calculated 4491 . 94 ) ; DP-185 (SEQ 
10 ID:3):not done (calculated 4546 . 97) . 
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6.1.2. VIRUS 
The HIV-^ virus was obtained from R. Gallo 
(Popovic, M. et al., 1984, Science 224:497-508) and 
propagated in CEM cells cultured in RPMI 1640 
5 containing 10% fetal calf serum. Supernatant from the 
infected CEM cells was passed through a 0.2/tm filter 
and the infectious titer estimated in a 
microinfectivity assay using the AA5 cell line to 
support virus replication. For this purpose, 25/zl of 
10 serial diluted virus was added to 75/il AA5 cells at a 
concentration of 2 x lo s /ml in a 96-well microtitre 
plate. Each virus dilution was tested in triplicate. 
Cells were cultured for eight days by addition of 
fresh medium every other day. On day 8 post 
i5 infection, supernatant samples were tested for virus 
replication as evidenced by reverse transcriptase 
activity released to the supernatant. The TCID 50 was 
calculated according to the Reed and Muench formula 
(Reed, L.J. et al . , 1938, Am. J. Hyg. 27:493-497). 
^ The titer of the HIV-l^ and HIV-l m stocks used for 
20 these studies, as measured on the AA5 cell line, was 
approximately 1.4 x 10 6 and 3.8 x 10 4 TCID S0 /ml, 
respectively. 

6.1.3. CELL FUSTQN ASSAY 

Approximately 7 x 10 4 Molt cells were incubated 
25 with 1 x 10* CEM cells chronically infected with the 
hiv -1l*i virus in 96-well plates (one-half area cluster 
plates; Costar, Cambridge, MA) in a final volume of 
100/tl culture medium as previously described 
(Matthews, T.J. et al., 1987, Proc. Natl. Acad. Sci. 
30 USA 84: 5424-5428) . Peptide inhibitors were added in 
a volume of lOfil and the cell mixtures were incubated 
for 24 hr. at 37°C. At that time, multinucleated 
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giant cells were estimated by microscopic examination 
at a 4 Ox magnification which allowed visualization of 
the entire well in a single field. 

6.1.4. CELL FREE VIRUS INFECTION ASSAY 
5 Synthetic peptides were incubated at 37°C with 

either 247 TCID 50 (for experiment depicted in FIG. 2) , 
or 62 TCID 50 (for experiment depicted in FIG. 3) units 
of HIV-1^! virus or 25 TCID S0 units of HIV-2 NIHZ and CEM 
CD4 + cells at peptide concentrations of 0, 0.04, 0.4, 
10 4.0, and 40Mg/ml for 7 days. The resulting reverse 
transcriptase (RT) activity in counts per minute was 
determined using the assay described, below, in 
Section 6.1.5. See, Reed, L.J. et al . , 1938, Am. J. 
Hyg. 27: 493-497 for an explanation of TCID 50 
calculations . 

15 

6.1.5. REVERSE TRANSCRIPTASE ASSAY 
The micro-reverse transcriptase (RT) assay was 
adapted from Goff et al . (Goff, S. et al . , 1981, J. 
Virol. 38:239-248) and Willey et al . (Willey, R. et 

20 al. , 1988, J. Virol. 62:139-147). Supernatants from 
virus/cell cultures are adjusted to 1% Triton-XlOO. A 
10/zl sample of supernatant was added to 50/il of RT 
cocktail in a 96-well U-bottom microtitre plate and 
the samples incubated at 37°C for 90 min. The RT 

25 cocktail contained 75mM KC1, 2mM dithiothreitol, 5mM 
MgCl 2 , 5/zg/ml poly A (Pharmacia, cat. No. 27-4110-01), 
0,25 units/ml oligo dT (Pharmacia, cat. No. 27-7858- 
01), 0.05% NP40, 50mM Tris-HCl, pH 7.8, 0.5/xM non- 
radioactive dTTP, and 10/zCi/ml 32 P-dTTP (Amersham, cat. 
No. PB. 10167) . 

30 

After the incubation period, 40/zl of reaction 
mixture was applied to a Schleicher and Schuell (S+S) 
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NA45 membrane (or DE81 paper) saturated in 2 x SSC 
buffer (o.3M Naci and 0.003M sodium citrate) held in a 
S+S Minifold over one sheet of GB003 (S+S) filter 
paper, with partial vacuum applied. Each well of the 
minifold was washed four times with 200//1 2xSSC, under 
5 full vacuum. The membrane was removed from the 

minifold and washed 2 more times in a pyrex dish with 
an excess of 2xSSC. Finally, the membrane was drained 
on absorbent paper, placed on Whatman #3 paper, 
covered with Saran wrap, and exposed to film overnight 
10 at -70°C. 
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6.2. RESULTS 
6.2.1. PEPTIDE INHIBITION OF INFECTED CELL- 



INDUCED S YNCYTIA FORMATION 



The initial screen for antiviral activity assayed 
peptides' ability to block syncytium formation induced 
by overnight co-cultivation of uninfected Molt4 cells 
with chronically HIV-l infected CEM cells. The 
results of several such experiments are presented 
herein. In the first of these experiments, serial 
DP178 (SEQ ID:1) peptide concentrations between 
10/ig/ml and 12.5ng/ml were tested for blockade of the 
cell fusion process. For these experiments, CEM cells 
chronically infected with either HIV-1^, HIV-l,,, HIV- 
1 RF , or HIV-i SF2 virus were cocultivated overnight with 
25 uninfected Molt 4 cells. The results (FIG. 4) show 
that DP178 (SEQ ID:1) afforded complete protection 
against each of the HIV-l isolates down to the lowest 
concentration of DP178 (SEQ ID;1) used. For HIV^ 
inhibition, the lowest concentration tested was 
12.5ng/ml; for all other HIV-l viruses, the lowest 
concentration of DP178 (SEQ ID:1) used in this study 
was lOOng/ml. A second peptide, DP-180 (SEQ ID:2) , 
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containing the same amino acid residues as DP178 (SEQ 
ID:1) but arranged in a random order exhibited no 
evidence of anti-fusogenic activity even at the high 
concentration of 40/ig/ml (FIG. 4) . These observations 
indicate that the inhibitory effect of DP178 (SEQ 
5 ID:1) is primary sequence-specific and not related to 
non-specific peptide/protein interactions. The actual 
endpoint ( i.e. , the lowest effective inhibitory 
concentration) of DP178 inhibitory action is within 
the range of 1-10 ng/ml. 

10 Th e next series of experiments involved the 

preparation and testing of a DP178 (SEQ ID:1) homolog 
for its ability to inhibit HIV- 1- induced syncytia 
formation. As shown in FIG. 1, the sequence of DP- 18 5 
(SEQ ID: 3) is slightly different from DP178 (SEQ ID:lj 

^ in that its primary sequence is taken from the HIV-1 SF2 
isolate and contains several amino acid differences 
relative to DP178 (SEQ ID:1) near the N terminus. As 
shown in FIG. 4, DP-185 (SEQ ID:3), exhibits 
inhibitory activity even at 312.5ng/ml, the lowest 
concentration tested. 

20 The next series of experiments involved a 

comparison of DP178 (SEQ ID:1) HIV-1 and HIV- 2 
inhibitory activity. As shown in FIG. 5, DP178 (SEQ 
ID:1) blocked HIV-l-mediated syncytia formation at 
peptide concentrations below Ing/ml. DP178 (SEQ ID:1) 

25 failed, however, to block HIV-2 mediated syncytia 

formation at concentrations as high as 10^g/ml. This 

striking 4 log selectivity of DP178 (SEQ ID:1) as an 

inhibitor of HIV-l-mediated cell fusion demonstrates 

an unexpected HIV-1 specificity in the action of DP178 

_ (SEQ ID:1). DP178 (SEQ ID:1) inhibition of HIV-1- 
30 

mediated cell fusion, but the peptide's inability to 
inhibit HIV-2 medicated cell fusion in the same cell 
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type at the concentrations tested provides further 
evidence for the high degree of selectivity associated 
with the antiviral action of DP178 (SEQ ID:1) . 

6.2.2. PEPTIDE INHIBITION OF INFECTION BY 
CELL -FREE VIRUS 

DP178 (SEQ ID:1) was next tested for its ability 
to block CD-4* CEM cell infection by cell free HIV-1 
virus. The results, shown in FIG. 2, are from an 
experiment in which DP178 (SEQ ID:1) was assayed for 
its ability to block infection of CEM cells by an 
hiv - 1 lai isolate. Included in the experiment were 
three control peptides, DP-116 (SEQ ID:9), DP-125 (SEQ 
ID:8), and DP- 118 (SEQ ID:10). DP-116 (SEQ ID:9) 
represents a peptide previously shown to be inactive- - 
using this assay, and DP-125 (SEQ ID:8; Wild, C. et 
al., 1992, Proc. Natl. Acad, Sci. USA 89:10, 537) and 
DP-H8 (SEQ ID: 10) are peptides which have previously 
been shown to be active in this assay. Each 
concentration (0, 0.04, 0.4, 4, and 40/^g/ml) of 
peptide was incubated with 247 TCID S0 units of HIV-i, 
20 virus and CEM cells. After 7 days of culture, cell- 
free supernatant was tested for the presence of RT 
activity as a measure of successful infection. The 
results, shown in FIG. 2, demonstrate that DP178 (SEQ 
ID:l) inhibited the de novo infection process mediated 
25 by the HIV-l viral isolate at concentrations as low as 
90ng/ml (IC50=90ng/ml) . in contrast, the two positive 
control peptides, DP-125 (SEQ: ID: 8) and DP-118 (SEQ 
ID:10), had over 60-fold higher IC50 concentrations of 
approximately 5^g/ml. 

In a separate experiment, the HIV-l and HIV- 2 
inhibitory action of DP178 (SEQ ID:1) was tested with 
CEM cells and either HIV-1^ or HIV-2 NIK2 . 62 TCID S0 
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HIV-l^j or 25 GCID S0 HIV-2 NIHZ were used in these 
experiments, and were incubated for 7 days. As may be 
seen in FIG. 3, DP178 (SEQ ID:1) inhibited HIV-1 
infection with an IC50 of about 31ng/ml. In contrast, 
DP178 (SEQ ID:1) exhibited a much higher IC50 for HIV- 
5 2 nihz* thus making DP178 (SEQ ID:1) two logs more potent 
as a HIV-l inhibitor than a HIV- 2 inhibitor. This 
finding is consistent with the results of the fusion 
inhibition assays described, above, in Section 6.2.1, 
and further supports a significant level of 
10 selectivity ( i.e. , for HIV-1 over HIV-2) . 

7. EXAMPLE: THE HIV-1 INHIBITOR, 

DP178 (SEP ID:1) IS NON-CYTOTOXIC 

In this Example, the 36 amino acid synthetic 

peptide inhibitor DP178 (SEQ ID:1) is shown to be non- 

15 cytotoxic to cells in culture, even at the highest 

peptide concentrations (40/ig/ml) tested. 

7.1. MATERIALS AMD METHODS 
Cell proliferation and toxicity assay: 
20 Approximately 3.8x10 s CEM cells for each peptide 

concentration were incubated for 3 days at 3 7°C in T25 
flasks. Peptides tested were DP178 (SEQ ID:1) and DP- 
116 (SEQ ID:9), as described in FIG. 1. Peptides were 
synthesized as described, above, in Section 6.1. The 
concentrations of each peptide used were 0, 2.5, 10, 
and 4 0Mg/ml. Cell counts were taken at incubation 
times of 0, 24, 48, and 72 hours. 

7.2. RESULTS 
Whether the potent HIV-1 inhibitor DP178 (SEQ 
30 ID:1) exhibited any cytotoxic effects was assessed by 
assaying the peptide's effects on the proliferation 
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and viability of cells in culture. CEM cells were 
incubated in the presence of varying concentrations of 
DP178 (SEQ ID:!), and DP-US (SEQ ID:9), a peptide 
previously shown to be ineffective as a HIV inhibitor 
(Wild, C. et_al., 1992, Proc. Natl. Acad. Sci. USA 
5 89:10,537-10,541). Additionally, cells were incubated 
in the absence of either peptide. 

The results of the cytotoxicity study demonstrate 
that DP178 (SEQ ID:1) exhibits no cytotoxic effects on 
cells in culture. As can be seen, below, in Table VI, 
10 even the proliferation and viability characteristics 
of cells cultured for 3 days in the presence of the 
highest concentration of DP178 {SEQ ID:1) tested 
(40 M g/ml) do not significantly differ from the DP- lis 
(SEQ ID: 9) or the no-peptide controls. The cell 
i5 proliferation data is also represented in graphic form 
in FIG. 6. As was demonstrated in the Working Example 
presented above in Section 6, DP178 (SEQ ID:1) 
completely inhibits HIV-l mediated syncytia formation 
at peptide concentrations between 1 and lOng/ml, and 
completely inhibits cell -free viral infection at 
concentrations of at least 90ng/ml. Thus, this study 
demonstrates that even at peptide concentrations 
greater than 3 log higher than the HIV inhibitory 
dose, DP178 (SEQ ID:1) exhibits no cytotoxic effects. 
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Table VI 



% Viability 
at time (hours) 

Peptide 

Peptide Concentration ua/ml 0 24 48 72 



DP178 
(SEQ 
ID:1) 



10 



40 

10 
2.5 



98 

98 
98 



97 

97 
93 



95 

98 
96 



97 

98 
96 



DP116 
(SEQ 
ID: 9) 



15 



40 

10 
2.5 



98 

98 
98 



95 

95 
96 



98 

93 
98 



97 

98 
99 



No 
Peptide 



98 



97 



99 



98 



20 



25 



30 



8 . EXAMPLE: THE INTERACTION OF DP178 AND DP107 
Soluble recombinant forms of gp41 used in the 
example described below provide evidence that the 
DP178 peptide associates with a distal site on gp41 
whose interactive structure is influenced by the DP107 
leucine zipper motif. A single mutation disrupting 
the coiled-coil structure of the leucine zipper domain 
transformed the soluble recombinant gp41 protein from 
an inactive to an active inhibitor of HIV-1 fusion. 
This transformation may result from liberation of the 
potent DP178 domain from a molecular clasp with the 
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10 



leucine zipper, DP107, determinant. The results also 
indicate that the anti-HlV activity of various gp4i 
derivatives (peptides and recombinant proteins) may be 
due to their ability to form complexes with viral gp4i 
and interfere with its fusogenic process. 

8.1. MATERIALS AND METHnns 

8.1.1. CONSTRUCTION OF FUSION PROTEINS 
AND GP41 MUTANTS 

Construction of fusion proteins and mutants shown 
in FIG. 7 was accomplished as follows: the DNA 
sequence corresponding to the extracellular domain of 
gp4l (540-686) was cloned into the Xmn I site of the 
expression vector P Mal-p2 (New England Biolab) to give 
M41. The gp4l sequence was amplified from pgtat 
15 (Malitn et al., 1988, Nature 355: 181-183) by using 
polymerase chain reaction (PCR) with upstream primer 
5 1 -ATGACGCTGACGGTACAGGCC-3 1 (primer A) and downstream 
primer 5 ' -TGACTAAGCTTAATACCACAGCCAATTTGTTAT-3 1 (primer 
B) . M41-P was constructed by using the T7-Gen 
20 in vitro mutagenesis kit from United States 
Biochemicals (USB) following the supplier's 
instructions. The mutagenic primer (5'- 
GGAGCTGCTTGGGGCCCCAGAC-3 1 ) introduces an lie to Pro 
mutation in M41 at position 578. M41A107, from which 
2s the DP- 107 region has been deleted, was made using a 
deletion mutagenic primer 5'- 

CCAAATCCCCAGGAGCTGCTCGAGCTGCACTATACCAGAC-3 • (primer C) 
following the USB T7-Gen mutagenesis protocol. 
M41A178, from which the DP-178 region has been 
deleted, was made by cloning the DNA fragment 
corresponding to gp41 amino acids 540-642 into the 
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Xmn I site of pMal-p2. Primer A and 5'- 
ATAGCTTCTAGATTAATTGTTAATTTCTCTGTCCC-3 1 (primer D) were 
used in the PCR with the template pgtat to generate 
the inserted DNA fragments. M41-P was used as the 
template with primer A and D in PCR to generate M41- 
5 PA178. All inserted sequences and mutated residues 
were checked by restriction enzyme analysis and 
confirmed by DNA sequencing. 

8.1.2. PURIFICATION AND CHARACTERIZATION 

OF FUSION PROTEINS 

10 

The fusion proteins were purified according to 
the protocol described in the manufacturer's brochure 
of protein fusion and purification systems from New 
England Biolabs (NEB). Fusion proteins (10 ng) were" 
analyzed by electrophoresis on 8% SDS polyacryl amide 

15 gels. Western blotting analysis was performed as 

described by Sambrook et al., 1989, Molecular Cloning: 
A Laboratory Manual, 2d Ed, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, Ch. 18, 
pp. 64-75. An HIV-1 positive serum diluted 1000-fold, 

2 0 or a human Fab derived from repertoire cloning was 
used to react with the fusion proteins. .The second 
antibody was HRP-conjugated goat antihuman Fab. An 
ECL Western blotting detection system (Amersham) was 
used to detect the bound antibody. A detailed 
protocol for this detection system was provided by the 
manufacturer. Rainbow molecular weight markers 
(Amersham) were used to estimate the size of fusion 
proteins . 

8.1-3. CELL FUSION ASSAYS FOR ANT I -HIV ACTIVITY 
3 J Cell fusion assays were performed as previously 

described (Matthews et al., 1987, Proc: Natl. Acad. 
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Sci. USA 84: 5424-5481). CEM cells (7 X 10«) were 

incubated with HIV-i 1I1B chronically infected CEM cells 
(10 4 ) in 96-well flat -bottomed half -area plates 
(Costar) in 100 fil culture medium. Peptide and fusion 
proteins at various concentrations in 10 /il culture 
5 medium were incubated with the cell mixtures at 37°C 
for 24 hours. Multinucleated syncytia were estimated 
with microscopic examination. Both M41 and M41-P did 
not show cytotoxicity at the concentrations tested and 
shown in FIG. 8. 

10 Inhibition of HIV-l induced cell -cell fusion 

activity was carried out in the presence of 10 nM 
DP178 and various concentrations of M41A178 or M41- 
PA178 as indicated in FIG. 9. There was no observable 
syncytia in the presence of 10 nM DP178 . No peptide' " 

is or fusion protein was added in the control samples. 

8.1.4. ELISA ANALYSIS OF DP178 BINDING 

TO THE TjBTTfTTJP ZIPPER MQTIF OF ftPAl 
The amino acid sequence of DP178 used is: 
YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF. For enzyme 
20 linked immunoassay (ELISA) , M41A178 or M41-PA178 (5 
/ig/ml) in 0.1M NaHC0 3 , p H 8.6, were coated on 96 wells 
Linbro ELISA plates (Flow Lab, Inc.) overnight. Each 
well was washed three times with distilled water then 
blocked with 3% bovine serum albumin (BSA) for 2 
25 hours. After blocking, peptides with 0.5% BSA in TBST 
(40 mM Tris-HCl pH7.5, 150 mM NaCI, 0.05% Tween 20) 
were added to the ELISA plates and incubated at room 
temperature for 1 hour. After washing three times 
with TBST, Fab-d was added at a concentration of 10 
ng/ml with 0.5% BSA in TBST. The plates were washed 
three times with TBST after incubation at room 
temperature for 1 hour. Horse radish peroxidase (HRP) 
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conjugated goat antihuman Fab antiserum at a 2 000 fold 
dilution in TBST with 0.5% BSA was added to each well 
and incubated at room temperature for 45 minutes. The 
plates were then washed four times with TBST. The 
peroxidase substrate o-phenylene diamine (2.5 mg/ml) 
5 and 0.15% H 2 0 2 were added to develop the color. The 
reaction was stopped with an equal volume of 4.5 N 
H 2 S0 4 after incubation at room temperature for 10 
minutes. The optical density of the stopped reaction 
mixture was measured with a micro plate reader 
10 {Molecular Design) at 4 90 nm. Results are shown in 
FIG. 10. 



8.2. RESULTS 

8.2.1. THE EXPRESSION AND CHARACTERIZATION 
OF THE ECTODOMAIN OF qp41 

15 As a step toward understanding the roles of the 

two helical regions in gp41 structure and function, 
the ectodomain of gp41 was expressed as a maltose 
binding fusion protein (M41) (FIG. 7) . The fusogenic 
peptide sequence at the N- terminal of gp41 was omitted 

20 from this recombinant protein and its derivatives to 
improve solubility. The maltose, binding protein 
facilitated purification of the fusion proteins under 
relatively mild, non-denaturing conditions. Because 
the M41 soluble recombinant gp41 was not glycosylated, 

^ lacked several regions of the transmembrane protein 
( i.e. , the fusion peptide, the membrane spanning, and 
the cytoplasmic domains) , and was expressed in the 
absence of gpl20, it was not expected to precisely 
reflect the structure of native gp41 on HIV-1 virions. 
Nevertheless, purified M41 folded in a manner that 

30 preserved certain discontinuous epitopes as evidenced 
by reactivity with human monoclonal antibodies, 98-6, 
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126-6, and 50-69, previously shown to bind 
conformational epitopes on native gp4l expressed in 
eukaryotic cells (Xu et al., 1991, j. Virol. 65: 4832- 
4838; Chen, 1994, J. Virol. 68:2002-2010). Thus, at 
least certain regions of native gp41 defined by these 
5 antibodies appear to be reproduced in the recombinant 
fusion protein M41. Furthermore, M41 reacted with a 
human recombinant Fab (Fab-d) that recognizes a 
conformational epitope on gp4l and binds HIV-1 virions 
as well as HIV-1 infected cells but not uninfected 
10 cells as analyzed by FACS. Deletion of either helix 
motif, i.e. , DP107 or DP178, of the M41 fusion protein 
eliminated reactivity with Fab-d. These results 
indicate that both helical regions, separated by 60 
amino acids in the primary sequence, are required to 
maintain the Fab-d epitope. 

8.2.2. ANTI-HIV ACTIVITY OF THE 

RECOMBINANT ECTODOMAIN OF GP41 

The wild type M41 fusion protein was tested for 

anti-HIV-1 activity. As explained, supra , synthetic 

20 peptides corresponding to the leucine zipper (DP107) 

and the C-terminal putative helix (DP178) show potent 

anti-Hlv activity. Despite inclusion of both these 

regions, the recombinant M41 protein did not affect 

Hiv-l induced membrane fusion at concentrations as 

high as 50 fiM (Table VII, below). 
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Table VII 

DISRUPTION OP THE LEUCINE ZIPPER OF 
GP41 FREES THE ANTI-HIV MOTIF 



DP107 DPI 78 M41 M41-P M41-PA178 

lnM>50nM 83 nM >50 |iM 



3.5xl0* 9 2.5xl0" 8 



1 |iM 80 nM >16pM 66 nM >8 \M 



1 The affinity constants of Fab-d binding to the fusion proteins were determined" 
using a protocol described by B. Friguet et al, 1985, J. Immunol. Method. 
77:305-319. 

15 

- = No detectable binding of Fab-d to the fusion proteins. 

Antiviral Infect ivity Assays. 20 jil of serially diluted virus stock was incubated 
for 60 minutes at ambient temperature with 20 |il of the indicated concentration 
of purified recombinant fusion protein in RPMI 1640 containing 10% fetal 
bovine serum and antibiotics in a 96-well microtiter plate. 20 \i\ of CEM4 cells 
at 6 x 10 5 cells/ml were added to each well, and cultures were incubated at 37°C 

20 in a humidified C0 2 incubator. Cells were cultured for 9 days by the addition of 

fresh medium every 2 to 3 days. On days 5, 7, and 9 postinfection, supernatant 
samples were assayed for reverse transcriptase (RT) activity, as described 
below, to monitor viral replication. The 50% tissue culture infectious dose 
(TCID 50 ) was calculated for each condition according to the formula of Reed & 
Muench, 1937, Am. J. Hyg. 27:493-497. RT activity was determined by a 
modification of the published methods of Goff et al., 1981, J. Virol. 38:239-248 
and Willey et al., 1988, J. Virol. 62:139-147 as described in Chen et al., 1993, 

2 5 AIDS Res. Human Retroviruses 9:1079-1086. 



Cell fusion 
(IC 90 ) 1 HM 

Fab-D 
binding (k D ) 

HIV infectiv- 
ity (ICJ 



Surprisingly, a single amino acid substitution, 
proline in place of isoleucine in the middle of the 
leucine zipper motif, yielded a fusion protein (M41-P) 
which did exhibit antiviral activity (Table XXV and 
Fig. 8) . As seen in Table XXV, M41-P blocked syncytia 
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formation by 90% at approximately 85 nM and 
neutralized HIV-l IIIB infection by 90% at approximately 
70 nM concentrations. The anti-HIV-l activity of M41- 
P appeared to be mediated by the C-terminal helical 
sequence since deletion of that region from M41-P 
5 yielded an inactive fusion protein, M41-PA178 (Table 
XXV) . This interpretation was reinforced by 
experiments demonstrating that a truncated fusion 
protein lacking the DP178 sequence, M41A178, abrogated 
the potent anti-fusion activity of the DP178 peptide 

10 in a concentration- dependent manner (FIG. 9) . The 
same truncated fusion protein containing the proline 
mutation disrupting the leucine zipper, M41-PA178, was 
not active in similar competition experiments (FIG. 
9) . The results indicate that the DP178 peptide 
associates with a second site on gp41 whose 
interactive structure is dependent on a wild type 
leucine zipper sequence. A similar interaction may 
occur within the wild type fusion protein, M41, and 
act to form an intramolecular clasp which sequesters 
the DP178 region, making it unavailable for anti-viral 

20 activity. 

A specific association between these two domains 
is also indicated by other human monoclonal Fab-d 
studies. For example, Fab-d failed to bind either the 
DP178 peptide or the fusion protein M41A178, but its 
25 epitope was reconstituted by simply mixing these two 
reagents together (FIG. 10). Again, the proline 
mutation in the leucine zipper domain of the fusion 
protein, M41-PA178, failed to reconstitute the epitope 
in similar mixing experiments. 
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9. EXAMPLE: METHOD FOR COMPUTER -ASSISTED 
IDENTIFICATION OF DP107-LIKE 
AND DP178-LIKE SEQUENCES 

A number of known coiled- coil sequences have been 
well described in the literature and contain heptad 
5 repeat positioning for each amino acid. Coiled-coil 
nomenclature labels each of seven amino acids of a 
heptad repeat A through G, with amino acids A and D 
tending to be hydrophobic positions. Amino acids E 
and G tend to be charged. These four positions (A, D, 

^ E, and G) form the amphipathic backbone structure of a 
monomeric alpha-helix. The backbones of. two or more 
amphipathic helices interact with each other to form 
di-, tri-, tetrameric, etc., coiled-coil structures. 
In order to begin to design computer search motifs, a- 
series of well characterized coiled coils were chosen 

15 including yeast transcription factor GCN4 , Influenza 
Virus hemagglutinin loop 36, and human proto- oncogenes 
c-Myc, c-Fos, and c-Jun. For each peptide sequence, a 
strict homology for the A and D positions, and a list 
of the amino acids which could be excluded for the B, 

20 C, E, F, and G positions (because they are not 

observed in these positions) was determined. Motifs 
were tailored to the DP107 and DP178 sequences by 
deducing the most likely possibilities for heptad 
positioning of the amino acids of HIV-l Bru DP-107, 

25 which is known to have coiled-coil structure, and HIV- 
1 Bru DP178, which is still structurally undefined. 
The analysis of each of the sequences is contained in 
FIG. 12. For example, the motif for GCN4 was designed 
as follows: 

1. The only amino acids (using standard single 
30 letter amino acid codes) found in the A or D 

positions of GCN4 were [LMNV] . 
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2. All amino acids were found at B, C, E, F, and G 
positions except {CFGIMPTW}. 

3. The PESEARCH motif would, therefore, be written 
as follows : 

[LMNV] - { CFGIMPTW} (2) - [LMNV] -{ CFGIMPTW } (3) - 
ILMNV] - {CFGIMPTW} (2) - [LMNV] - {CFGIMPTW} (3) - 
[LMNV] - {CFGIMPTW} (2) - [LMNV] -{ CFGIMPTW} (3) - 
[LMNV] - {CFGIMPTW} (2) - [LMNV] - {CFGIMPTW } (3) 

Translating or reading the motif: "at the first A 
position either L, M, N, or v must occur; at positions 
B and C (the next two positions) accept everything 
except C, F, G, I, M, P, T, or W; at the D position 
either L, M, N, or V must occur; at positions E, F, 
and G (the next 3 positions) accept everything except" 
15 C, F, G, I, M, P, T, or W.« This statement is 

contained four times in a 28-mer motif and five times 
in a 35-mer motif. The basic motif key then would be- 
[LMNV] -{CFGIMPTW}. The motif keys for the remaining 
well described coiled-coil sequences are summarized in 
FIG. 12. 

i0 The motif design for DP107 and DP178 was slightly 

different than the 28-mer model sequences described 
above due to the fact that heptad repeat positions are 
not defined and the peptides are both longer than 28 
residues. FIG. 13 illustrates several possible 

25 sequence alignments for both DP107 and DP178 and also 
includes motif designs based on 28-mer, 35-mer, and 
full-length peptides. Notice that only slight 
differences occur in the motifs as the peptides are 
lengthened. Generally, lengthening the base peptide 
30 results in a less stringent motif. This is very 

useful in broadening the possibilities for identifying 



- 132 - 



WO 01/51673 



PCT/US00/35727 



DP107-or DP-178-like primary amino acid sequences 
referred to in this document as "hits". 

In addition to making highly specific motifs for 
each type peptide sequence to be searched, it is also 
possible to make "hybrid" motifs. These motifs are 
5 made by "crossing" two or more very stringent motifs 
to make a new search algorithm which will find not 
only both "parent" motif sequences but also any 
peptide sequences which have similarities to one, the 
other, or both "parents". For example, in FIG. 14 the 

10 "parent" sequence of GCN4 is crossed with each of the 
possible "parent" motifs of DP-107. Now' the hybrid 
motif must contain all of the amino acids found in the 
A and D positions of both parents, and exclude all of 
the amino acids not found in either parent at the 
other positions. The resulting hybrid from crossing 
GCN4 or [LMNV] {CFGIMPTW} and DP107 (28-mer with the 
first L in the D position) or [ILQT] {CDFIMPST} , is 
- [ILMNQTV] {CFIMPT} . Notice that now only two basic 
hybrid motifs exist which cover both framing 
possibilities, as well as all peptide lengths of the 

20 parent DP-107 molecule. FIG. 15 represents the 
"hybridizations" of GCN4 with DP-178. FIG. 16 
represents the "hybridizations" of DP107 and DP178. 
It is important to keep in mind that the represented 
motifs, both parent and hybrid, are motif keys and not 

25 the depiction of the full-length motif needed to 
actually do the computer search. 

Hybridizations can be performed on any 
combination of two or more motifs. FIG. 17 
summarizes several three -motif hybridizations 

3Q including GCN4, DP107 (both frames) , and DP178 (also 
both frames) . Notice that the resulting motifs are 
now becoming much more similar to each other. In 

- 133 - 



WO 01/51673 



PCT/USOO/35727 



fact, the first and third hybrid motifs are actually 
subsets of the second and fourth hybrid motifs 
respectively. This means that the first and third 
hybrid motifs are slightly more stringent than the 
second and fourth. It should also be noted that with 
5 only minor changes in these four motifs, or by 

hybridizing them, a single motif could be obtained 
which would find all of the sequences. However, it 
should be remembered that stringency is also reduced. 
Finally, the most broad- spectrum and least -stringent 
10 hybrid motif is described in PIG . 18 which summarizes 
the hybridization of GCN4, DP107 (both frames), DP178 
(both frames), c-Fos, c-Jun, c-Myc, and Flu loop 36. 

A special set of motifs was designed based on the 
fact that DP-178 is located only approximately ten 
is amino acids upstream of the transmembrane spanning 
region of gp41 and just C-terminal to a proline which 
separates DP107 and DP178. it has been postulated 
that DP178 may be an amphipathic helix when membrane 
associated, and that the proline might aid in the 
initiation of the helix formation. The same 
arrangement was observed in Respiratory Syncytial 
Virus; however, the DPl78-like region in this virus 
also had a leucine zipper just C-terminal to the 
proline. Therefore, N-terminal proline -leucine zipper 
motifs were designed to analyze whether any other 
25 viruses might contain this same pattern. The motifs 
are summarized in FIG. 19. 

The PC/Gene protein database contains 5879 viral 
amino acid sequences (library file PVIRUSES; CD-ROM 
release n.o) . Of these, 1092 are viral enveloped or 
3£) glycoprotein sequences (library file PVIRUSEl) . 

Tables V through XIV contain lists of protein sequence 
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names and motif hit locations for all the motifs 
searched. 

10. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION 
OF DP107 AND DP178-LIKE SEQUENCES 
IN HUMAN IMMUNODEFICIENCY VIRUS 

FIG. 2 0 represents search results for HIV-1 BRU 
isolate gp41 (PC/Gene protein sequence PENV_HV1BR) . 
Notice that the hybrid motif which crosses DP-107 and 
DP-178 (named 107x178x4; the same motif as found in 

10 FIG ' 16 found three kits including amino acids 550- 

599, 636-688, and 796-823. These areas include DP-107 
plus eight N-terminal and four C-terminal amino acids; 
DP178 plus seven N-terminal and ten C-terminal amino 
acids; and an area inside the transmembrane region *■ 
(cytoplasmic) . FIG. 20 also contains the results 

15 obtained from searching with the motif named ALLMOTI5 , 
for which the key is found in FIG. 17 ( { CDGHP } 
{CFP}x5) . This motif also found three hits including 
DP107 (amino acids 510-599), DP178 (615-717), and a 
cytoplasmic region (772-841) . These hits overlap the 

20 hits found by the motif 107x178x4 with considerable 
additional sequences on both the amino and carboxy 
termini. This is not surprising in that 107x178x4 is 
a subset of the ALLMOTI5 hybrid motif. Importantly, 
even though the stringency of ALLMOTI5 is considerably 

25 less than 107x178x4, it still selectively identifies 
the DPI 07 and DP178 regions of gp41 shown to contain 
sequences for inhibitory peptides of HIV-1. The 
results of these two motif searches are summarized in 
Table V of U.S. Patent Application Serial No. 
08/470,896 filed on June 6 # 1995 (incorporated herein 

30 by reference in its entirety) under the PC/Gene 

protein sequence name PENV_HV1BR. The proline -leucine 
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zipper motifs also gave several hits in HIV-1 BRU 
including 503-525 which is at the very C-terminus of 
gpl20, just upstream of the cleavage site (P7LZIPC and 
P12LZIPC) ; and 735-768 in the cytoplasmic domain of 
gp41 (P23LZIPC) . These results are found in Tables 
5 VIII, ix, and X under the same sequence name as 

mentioned above. Notice that the only area of HIV-1 
BRU which is predicted by the Lupas algorithm to 
contain a coiled-coil region, is from amino acids 635- 
670. This begins eight amino acids N-terminal to the 
10 start and ends eight amino acids N-terminal to the end 
of DP178. DP107, despite the fact that it is a known 
coiled coil, is not predicted to contain a coiled-coil 
region using the Lupas method. 



15 



11. EXAMPLE: COMPUTER-ASSISTED IDENTIFICATION 
OF DP107-LIKE AND DP178-LIKE 
SEQUENCES IN HUMAN RESPIRATORY 
SYNCYTIAL VTPTTg 

FIG. 21 represents search results for Human 
Respiratory Syncytial virus (RSV; Strain A2) fusion 
20 glycoprotein Fl (PC/Gene protein sequence name PVGLF_ 
HRSVA) . Motif 107X178X4 finds three hits including 
amino acids 152-202, 213-243, and 488-515. The 
arrangement of these hits is similar to what is found 
in HIV-1 except that the motif finds two regions with 
similarities to DP- 178, one just downstream of what 
would be called the DP107 region or amino acids 213- 
243, and one just upstream of the transmembrane region 
(also similar to DP178) or amino acids 488-515. Motif 
ALLMOTI5 also finds three areas including amino acids 
116-202, 267-302, and 506-549. The proline-leucine 
30 zipper motifs also gave several hits including amino 
acids 205-221 and 265-287 (P1LZIPC 265-280, P12LZIPC) , 
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and 484-513 (P7LZIPC and P12LZIPC 484-506, P23LZIPC) . 
Notice that the PLZIP motifs also identify regions 
which share location similarities with DP- 178 of HIV- 
1. 

5 12. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP107-LIKE AND DP178-LIKE SEQUENCES 
IN SIMIAN IMMUNODEFICIENCY VIRUS 

Motif hits for Simian immunodeficiency Virus gp41 
(AGM3 isolate; PC/Gene protein sequence name 
PENV SIVAG) are shown in FIG. 22. Motif 107x178x4 

10 

finds three hits including amino acids 566-593, 597- 
624, and 703-730. The first two hits only have three 
amino acids between them and could probably be 
combined into one hit from 566-624 which would 
represent a DP107-like hit. Amino acids 703 to 730 

15 would then represent a DP178-like hit. ALLMOTI5 also 
finds three hits including amino acids 556-628 (DP107- 
like) , 651-699 (DP178-like) , and 808-852 which 
represents the transmembrane spanning region. SIV 
also has one region from 655-692 with a high 

20 propensity to form a coiled coil as predicted by the 
Lupas algorithm. Both 107x178x4 and ALLM0TI5 motifs 
find the same region. SIV does not have any PLZIP 
motif hits in gp41. 

The identification of DP178/DP107 analogs for a 
second SIV isolate (MM251) is demonstrated in the 

25 

Example presented, below, in Section 19. 

13. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP107-LIKE AND DP178 LIKE SEQUENCES 
IN CANINE DISTEMPER VIRUS 

30 Canine Distemper Virus (strain Onderstepoort) 

fusion glycoprotein Fl (PC/Gene Protein sequence name 
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PVGLP_CDVO) has regions similar to Human RSV which are 
predicted to be DPl07-like and DPl78-like (FIG. 23). 
Motif 107x178x4 highlights one area just C-terminal to 
the fusion peptide at amino acids 252-293. Amino 
acids 252-286 are also predicted to be coiled coil 
5 using the Lupas algorithm. Almost 100 amino acids C- 
terminal to the first region is a DPl78-like area at 
residues 340-367. ALLMOTI5 highlights three areas of 
interest including: amino acids 228-297, which 
completely overlaps both the Lupas prediction and the 

10 DPl07-like 107x178x4 hit; residues 340-381, which 
overlaps the second 107x178x4 hit; and amino acids 
568-602, which is DPl78-like in that it is located 
just N-terminal to the transmembrane region. It also 
overlaps another region (residues 570-602) predicted" 

i5 by the Lupas method to have a high propensity to form 
a coiled coil. Several PLZIP motifs successfully 
identified areas of interest including P6 and P12LZIPC 
which highlight residues 336-357 and 336-361 
respectively; Pi and P12LZIPC which find residues 398- 
414; and P12 and P23LZIPC which find residues 562-589 

20 and 562-592 respectively. 

14. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP107-LIKE AND DP178-LIKE SEQUENCES 
IN NEWCASTLE DISEASE VIRUS 

25 FIG. 24 shows the motif hits found in Newcastle 

Disease Virus (strain Australia -Victoria/32 ; PC Gene 
protein sequence name PVGLF_NDVA) . Motif 107x178x4 
finds two areas including a DPl07-like hit at amino 
acids 151-178 and a DPi78-like hit at residues 426- 
512. ALLMOTI5 finds three areas including residues 

30 117-182, 231-272, and 426-512. The hits from 426-512 
include a region which is predicted by the Lupas 
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method to have a high coiled-coil propensity (460- 
503) . The PLZIP motifs identify only one region of 
interest at amino acids 273-289 (PI and 12LZIPC) . 

15. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION 
5 OF DP107-LIKE AND DP178-LIKE 

SEQUENCES IN HUMAN PARAINFLUENZA VIRUS 

Both motifs 107x178x4 and ALLMOTI5 exhibit 
DP107-like hits in the same region, 115-182 and 117- 
182 respectively, of Human Parainfluenza Virus (strain 
NIH 47885; PC/Gene protein sequence name PVGLF_pl3H4; 
(FIG. 25) . In addition, the two motifs have a DP178- 
like hit just slightly C-terminal at amino acids 207- 
241. Both motifs also have DP178-like hits nearer the 
transmembrane region including amino acids 457-497 and 
462-512 respectively. Several PLZIP motif hits are 
15 also observed including 283-303 (P5LZIPC) , 283-310 

(P12LZIPC), 453-474 (P6LZIPC) , and 453-481 (P23LZIPC) . 
The Lupas algorithm predicts that amino acids 122-176 
may have a propensity to form a coiled-coil. 

2 0 16 - EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 

DP107-LIKE AND DP178-LIKE SEQUENCES OF 
INFLUENZA A VIRUS __ 

FIG. 2 6 illustrates the Lupas prediction for a 
coiled coil in Influenza A Virus (strain A/Aichi/2/68) 
at residues 379-436, as well as the motif hits for 

25 107x178x4 at amino acids 387-453, and for ALLMOTI5 at 
residues 380-456. Residues 383-471 (38-125 of HA2) 
were shown by Carr and Kim to be an extended coiled 
coil when under acidic pH (Carr and Kim, 1993, Cell 
73: 823-832). The Lupas algorithm predicts a coiled- 

30 coil at residues 379-436. All three methods 

successfully predicted the region shown to actually 
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have coiled-coil structure; however, ALLMOTI5 
predicted the greatest portion of the 88 residue 
stretch. 

17. EXAMPLE: POTENTIAL RESPIRATORY SYNCYTIAL VIRUS 
5 DP178/DP107 ANALOGS: CD AND 

ANTIVIRAL CHARACTER T7BTTnM 

In the Example presented herein, respiratory 
syncytial virus (RSV) peptides identified by utilizing 
the computer-assisted search motifs described in the 
Examples presented in Sections 9 and 11, above, were 

10 tested for ant i -RSV activity. Additionally, circular 
dichroism (CD) structural analyses were conducted on 
the peptides, as discussed below. It is demonstrated 
that several of the identified peptides exhibit potent 
antiviral capability. Additionally, it is shown that 

15 several of these peptides exhibit a substantial 
helical character. 

17. 1 MATERIALS AND METHODS 
Structural analyse- The CD spectra were 
2q measured in a lOmM sodium phosphate, 150mM sodium 
chloride, pH 7.0, buffer at approximately lOmM 
concentrations, using a 1 cm pathlength cell on a 
Jobin/Yvon Autodichrograph Mark V CD 
spectrophotometer. Peptides were synthesized 
according to the methods described, above, in Section 

2 5 

6.1. Peptide concentrations were determined from A^ 
using Edlehoch's method (1967, Biochemistry 5:1948) . 

Anti-RSV antiviral , activity assays : The assay 
utilized herein tested the ability of the peptides to 
disrupt the ability of HEp2 cells acutely infected 
30 with RSV (i.e. , cells which are infected with a 

multiplicity of infection of greater than 2) to fuse 
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and cause syncytial formation on a monolayer of 
uninfected an uninfected line of Hep-2 cells. The 
lower the observed level of fusion, the greater the 
antiviral activity of the peptide was determined to 
be. 

5 Uninfected confluent monolayers of Hep- 2 cells 

were grown in microtiter wells in 3% EMEM (Eagle 
Minimum Essential Medium w/o L-glutamine [Bio 
Whittaker Cat. No. 12-125F] , with fetal bovine serum 
[FBS; which had been heat inactivated for 3 0 minutes 
10 at 56°C; Bio Whittaker Cat. No. 14 -5 OIF) supplemented 
at 3%, antibiotics (penicillin/streptomycin; Bio 
Whittaker Cat. No. 17-602E) added at 1%, and glutamine 
added at 1%. 

To prepare Hep2 cells for addition to uninfected 
cells, cultures of acutely infected Hep2 cells were 
washed with DPBS (Dulbecco's Phosphate Buffered Saline 
w/o calcium or magnesium; Bio Whittaker Cat. No. 17- 
512F) and cell monolayers were removed with Versene 
(1:5000; Gibco Life Technologies Cat. No. 15040-017). 
The cells were spun 10 minutes and resuspended in 3% 
FBS. Cell counts were performed using a 
hemacytometer. Persistent cells were added to the. 
uninfected Hep-2 cells. 

The antiviral assay was conducted by, first, 
removing all media from the wells containing 

25 uninfected Hep-2 cells, then adding peptides (at the 
dilutions described below) in 3% EMEM, and 100 acutely 
RSV- infected Hep2 cells per well. Wells were then 
incubated at 37°C for 48 hours. 

After incubation, cells in control wells were 

3q checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of either 
Crystal Violet stain or XTT. With respect to Crystal 
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Violet, approximately SOpH o.25% Crystal Violet stain 
in methanol were added to each well. The wells were 
rinsed immediately, to remove excess stain, and were 
allowed to dry. The number of syncytia per well were 
then counted, using a dissecting microscope. 
5 With respect to XTT (2, 3 -bis l2-Methoxy-4-nitro-5- 

sulfophenyl] -2H-tetrazolium-5-carboxyanilide inner 
salt), 50/xl XTT (lmg/ml in RPMI buffered with lOOmM 
HEPES, pH 7.2-7.4, plus 5% DMSO) were added to each 
well. The OD 450/690 was measured (after blanking against 
10 growth medium without cells or reagents, and against 
reagents) according to standard procedures. 

Peptides : The peptides characterized in the 
study presented herein were: 

1) peptides T-142 to T-155 and T-575, as shown in FIG. 
is 27A, and peptides T-22 to T-27, T-68, T-334 and T-371 

to T-375 and T-575, as shown in FIG. 27B; 

2) peptides T-120 to T-141 and T-576, as shown in FIG. 
27B, and peptides T-12, T-13, T-15, T-19, T-28 to T- 
30, T-66, T-69, T-70 and T-576, as shown in FIG. 27D; 
and 

20 3) peptides T-67 and T-104 to T-119 and T-384, as 

shown in FIG. 28A, and peptides T-71, T-613 to T-617, 
T-662 to T-676 and T-730, as shown in FIG. 28B. 

The peptides of group 1 represent portions of the 
RSV F2 protein DPl78/l07-like region. The peptides of 

25 group 2 represent portions of the RSV Fl protein 

DPl07-like region. The peptides of groups 3 represent 
portions of the RSV Fl protein DPl78-like region. 

Each peptide was tested at 2-fold serial 
dilutions ranging from 100/tg/ml to approximately 

3Q lOOng/ml. For each of the assays, a well containing 
no peptide was also used. The IC S0 data for each 
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peptide represents the average of several experiments 
conducted utilizing that peptide. 

17.2 RESULTS 
The data summarized in FIGS. 27A-B and 28A-B 
5 represent antiviral and structural information 
obtained from peptides derived from the RSV F2 
DP178/DP107-like F2 region (FIG. 27A-B) , the RSV Fl 
DP-107-like region (FIG. 27C-D) and the RSV DP178-like 
F2 region (FIG. 28A-B) . 

10 As shown in FIGS. 27A-D, a number of the RSV 

DP178/DP107-like peptides exhibited a detectable level 
of antiviral activity. Peptides from the RSV 
DP178/DP107-like F2 region (FIG. 27A-B) , for example, 
T-142 to T-145 and T-334 purfied peptides, exhibited 

^ detectable levels of antiviral activity, as evidenced 
by their IC S0 values. Further, a number of RSV Fl 
DP107-like peptides (FIG. 27C-D) exhibited a sizable 
level of antiviral activity as purified peptides, 
including, for example, peptides T-124 to T-127, T- 
131, T-135 and T-137 to T-139, as demonstrated by 

20 their low IC S0 values. In addition, CD analysis FIG. 
27A, 27C) reveals that many of the peptides exhibit 
some detectable level of helical structure. 

The results summarized in FIG. 28A-B demonstrate 
that a number of DP178-like purified peptides exhibit 

25 a range of potent anti -viral activity. These peptides 
include, for example, T-67, T-104, T-105 and T-107 to 
T-119, as listed in FIG. 28A, and T-665 to T-669 and 
T-671 to T-673, as listed in FIG. 28B. In addition, 
some of the DP178-like peptides exhibited some level 
of helicity. 

Thus, the computer assisted searches described, 
hereinabove, successfully identified viral peptide 
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15 



30 



domains that represent highly promising anti-RSV 
antiviral compounds. 

18. EXAMPLE: POTENTIAL HUMAN PARAINFLUENZA VIRUS 
TYPE 3 DP178/DP107 ANALOGS : CD AND 
ANTIVIRA L CHARACTERIZATION 

In the Example presented herein, human 
parainfluenza virus type 3 (HPIV3) peptides identified 
by utilizing the computer- assisted search motifs 
described in the Examples presented in Sections 9 and 
15, above, were tested for anti-HPiV3 activity. 
Additionally, circular dichroism (CD) structural 
analyses were conducted on the peptides, as discussed 
below, it is demonstrated that several of the 
identified peptides exhibit potent antiviral 
capability. Additionally, it is shown that several of 
these peptides exhibit a substantial helical 
character. 



18 -1 MATERIALS AND METHODS 

Structural analyses: Structural analyses 
20 consisted of circular dichroism (CD) studies. The CD 
spectra were measured in a lOmM sodium phosphate, 
l50mM sodium chloride, pH 7.0, buffer at approximately 
lOmM concentrations, using a 1 cm pathlength cell on a 
Jobin/Yvon Autodichrograph Mark V CD 
25 spectrophotometer. Peptide concentrations were 
determined from A 280 using Edlehoch's method (1967, 
Biochemistry 6:1948) . 

Anti-HPIV3 antjviV g i activity assays : The assay 
utilized herein tested the ability of the peptides to 
disrupt the ability of Hep2 cells chronically infected 
with HPIV3 to fuse and cause syncytial formation on a 
monolayer of an uninfected line of CV-1W cells. The 
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more potent the lower the observed level of fusion, 
the greater the antiviral activity of the peptide. 

Uninfected confluent monolayers of CV-1W cells 
were grown in microtiter wells in 3% EMEM (Eagle 
Minimum Essential Medium w/o L-glutamine [Bio 
5 Whittaker Cat. No. 12-125F] , with fetal bovine serum 
[FBS; which had been heat inactivated for 3 0 minutes 
at 56°C; Bio Whittaker Cat. No. 14-501F) supplemented 
at 3%, antibiotics/antimycotics (Gibco BRL Life 
Technologies Cat. No. 15040-017) added at 1%, and 
10 glutamine added at 1%. 

To prepare Hep2 cells for addition to uninfected 
cells, cultures of chronically infected Hep2 cells 
were washed with DPBS (Dulbecco's Phosphate Buffered 
Saline w/o calcium or magnesium; Bio Whittaker Cat. 
No. 17-512F) and cell monolayers were removed with 
Versene (1:5000; Gibco Life Technologies Cat. No. 
15040-017) . The cells were spun 10 minutes and 
resuspended in 3% FBS. Cell counts were performed 
using a hemacytometer. Persistent cells were added to 
the uninfected CV-lW cells. 
20 The antiviral assay was conducted by, first, 

removing all media from the wells containing 
uninfected CV-lW cells, then adding peptides (at the 
dilutions described below) in 3% EMEM, and 500 
chronically HPIV3- infected Hep2 cells per well. Wells 
25 were then incubated at 37°C for 24 hours. 

On day 2, after cells in control wells were 
checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of 
approximately 50/il 0.25% Crystal Violet stain in 
3q methanol. Wells were rinsed immediately, to remove 

excess stain and were then allowed to dry. The number 
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of syncytia per well were then counted, using a 
dissecting microscope. 

Alternatively, instead of Crystal Violet 
analysis, cells were assayed with XTT, as described, 
avove, in Section 17.1. 
5 Peptides : The peptides characterized in the 

study presented herein were: 

1) Peptides 157 to 188, as shown in FIG. 29A, and 
peptides T-38 to T-40, T-42 to T-46 and T-582, as 
shown in FIG. 29B. These peptides are derived 

10 from the DPI 07 region of the HPIV3 Fl fusion 

protein (represented by HPF3 107, as shown in 
FIG. 29A) ; and 

2) Peptides 189 to 210, as shown in FIG. 30A, and T- 
269, T-626, T-383 and T-577 to T-579, as shown in 

15 FIG ' 303 * These peptides are primarily derived 

from the DP178 region of the HPIV3 Fl fusion 
protein (represented by HPF3 178, as shown in 
FIG. 30A) . Peptide T-626 contains two mutated 
amino acid resides (represented by a shaded 
background) . Additionally, peptide T-577 
represents Fl amino acids 65-100, T-578 
represents Fl amino acids 207-242 and T-579 
represents Fl amino acids 2 73-3 09. 

Each peptide was tested at 2 -fold serial 
25 dilutions ranging from 500/ig/ml to approximately 

500ng/ml. For each of the assays, a well containing 
no peptide was also used. 

18.2 RESULTS 

30 The data summar ized in FIGS. 29A-C and 30A-B 

represent antiviral and structural information 
obtained from peptides derived from the HPIV3 fusion 
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protein DP107-like region (FIG. 2 9A-C) and the HPIV3 
fusion protein DP178-like region (FIG. 30A-B) . 

As shown in FIG. 29A-B, a number of the HPIV3 
DP107-like peptides exhibited potent levels of 
antiviral activity. These peptides include, for 
5 example, peptides T-40, T-172 to T-175, T-178, T-184 
and T-185. 

CD analysis reveals that a number of the peptides 
exhibit detectable to substantial level of helical 
structure. The CD spectra for one of the peptides, 
10 184, which exhibits substantial helicity is summarized 
in FIG. 29C. 

The results summarized in FIG. 3 0A-B demonstrate 
that a number of the DP178-like peptides tested 
exhibit a range of anti-viral activity. These 
peptides include, for example, peptides 194 to 211, as 
evidenced by their low IC 50 values. In fact, peptides 
2 01 to 2 05 exhibit IC 50 values in the nanogram/ ml 
range. In addition, many of the DP178-like peptides 
exhibited some level of helicity. 

Thus, the computer assisted searches described, 
20 hereinabove, have successfully identified viral 

peptide domains that represent highly promising anti- 
HPIV3 antiviral compounds. 



19. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN SIMIAN 
25 IMMUNODEFICIENCY VIRUS 

FIG. 31 represents search results for SIV isolate 

MM251 (PC/Gene® protein sequence PENV_SIVM2) . Both 

107x178x4 and ALLMOTI5 search motifs identified two 

regions with similarities to DP107 and/or DP178. 

3 0 The peptide regions found by 107x178x4 were 

located at amino acid residues 156-215 and 277-289. 
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The peptide regions found by ALLMOTI5 were located at 
amino acid residues 156-219 and 245-286, Both motifs, 
therefore, identify similar regions. 

Interestingly, the first SIV peptide region 
(i^w from amino acid residue 156 to approximately 
5 amino acid residue 219) correlates with a DP107 

region, while the second region identified ( i.e. , from 
approximately amino acid residue 245 to approximately 
amino acid residue 289) correlates with the DP178 
region of HIV. In fact, an alignment of SIV isolate 

10 MM251 and HIV isolate BRU, followed by a selection of 
the best peptide matches for HIV DP107 and DP178, 
reveals that the best matches are found within the 
peptide regions identified by the 107x178x4 and 
ALLMOTI5 search motifs. 

15 I t should be noted that a potential coiled- coil 

region at amino acid residues 242-282 is predicted by 
the Lupas program. This is similar to the observation 
in HIV in which the coiled- coil is predicted by the 
Lupas program to be in the DP178 rather than in the 
DP107 region. It is possible, therefore, that SIV may 

2 0 

be similar to HIV in that it may contain a coiled-coil 
structure in the DP107 region, despite such a 
structure being missed by the Lupas algorithm. 
Likewise, it may be that the region corresponding to a 
DP178 analog in SIV may exhibit an undefined 
25 structure, despite the Lupas program's prediction of a 
coiled-coil structure. 

20. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN EPSTEIN-BARR 
VIRUS 

30 The results presented herein describe the 

identification of DP178/DP107 analogs within two 
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different Epstein-Barr Virus proteins. Epstein-Barr 
is a human herpes virus which is the causative agent 
of, for example, infectious mononucleosis (IM) , and is 
also associated with nasopharyngeal carcinomas (NPC) , 
Burkitt's lymphoma and other diseases. The virus 
5 predominantly exists in the latent form and is 
activated by a variety of stimuli. 

FIG. 32 depicts the search motif results for the 
Epstein-Barr Virus (Strain B95-8; PC/Gene® protein 
sequence PVGLB_EBV) glycoprotein gplio precursor 
10 (gpll5) . The 107x178x4 motif identified two regions 
of interest, namely the regions covered by amino acid 
residues 95-122 and 631-658. One PZIP region was 
identified at amino acid residue 732-752 which is most 
likely a cytoplasmic region of the protein. The Lupas 
algorithm predicts a coiled-coil structure for amino 
acids 657-684. No ALLMOTI5 regions were identified. 

FIG. 33 depicts the search motif results for the 
Zebra (or EB1) trans -activator protein (BZLF1) of the 
above -identified Epstein-Barr virus. This protein is 
a transcription factor which represents the primary 
20 mediator of viral reactivation. It is a member of the 
b-ZIP family of transcription factors and shares 
significant homology with the basic DNA-binding and 
dimerization domains of the cellular oncogenes c-fos 
and C/EBP. The Zebra protein functions as a 
25 homodimer. 

Search results domonstrate that the Zebra protein 
exhibits a single region which is predicted to be 
either of DP107 or DP178 similarity, and is found 
between the known DNA binding and dimerization regions 
3Q of the protein. Specifically, this region is located 
at amino acid residues 193-220, as shown in FIG. 33. 
The Lupas program predicted no coiled-coil regions. 
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21. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANAT. QGS IN MEASLES VTPTTR 

FIG. 34 illustrates the motif search results for 
the fusion protein Fi of measles virus, strain 
Edmonston (PC Gene® protein sequence PVGLF_MEASE) , 
5 successfully identifying DP178/DP107 analogs. 

The 107x178x4 motif identifies a single region at 
amino acid residues 228-262. The ALLM0TI5 search 
motif identifies three regions, including amino acid 
residues 116-184, 228-269 and 452-500. Three regions 
1() containing proline residues followed by a leucine 

zipper- like sequence were found beginning at proline 
residues 214, 286 and 451. 

The Lupas program identified two regions it 
predicted had potential for coiled-coil structure, •- 
which include amino acid residues 141-172 and 444-483. 



15 



20 



22. EXAMPLE: COMPUTER-ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN HEPATITIS B 
VIRUS 

FIG. 35 depicts the results of a PZIP motif 

search conducted on the Hepatitis B virus subtype AYW. 

Two regions of interest within the major surface 

antigen precursor S protein were identified. The 

first lies just C- terminal to the proposed fusion 

peptide of the major surface antigen (Hbs) which is 

found at amino acid residues 174-191. The second 

25 region is located at amino acid residues 233-267. The 

Lupas program predicts no coiled-coil repeat regions. 

in order to test the potential anti-HBV antiviral 

activity of these D178/DP107 analog regions, peptides 

derived from area around the analog regions are 

3() synthesized, as shown in FIG. 52A-B. These peptides 

represent one amino acid peptide "walks" through the 

putative DP178/DP107 analog regions. The peptides are 
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synthesized according to standard Fmoc chemistry on 
Rinkamide MBHA resins to provide for carboxy terminal 
blockade (Chang, CD. and Meinhofer, J., 1978, Int. J. 
Pept. Protein Res. 11:246-249; Fields, 6.B. and Noble, 
R.L., 1990, Int. J. Pept. Protein Res. 35:161-214). 
5 Follwing complete synthesis, the peptide amino- 

terminus is blocked through automated acetylation and 
the peptide is cleaved with trif luoroacetic acid (TFA) 
and the appropriate scavengers (King, D.S. et al., 
1990, Int. J. Pept. Res. 36:255-266). After cleavage, 

10 the peptide is precipitated with ether and dried under 
vacuum for 24 hours. 

The anti-HBV activity of the peptides is tested 
by utilizing standard assays to determine the test 
peptide concentration required to cause an acceptable 

15 (e.g. , 90%) decrease in the amount of viral progeny 
formed by cells exposed to an HBV viral inoculum. 
Candidate antivial peptides are further characterized 
in model systems such as wood chuck tissue culture and 
animal sytems, prior to testing on humans. 

20 23. EXAMPLE: COMPUTER- ASSISTED IDENTIFICATION OF 

DP178/DP107 ANALOGS IN SIMIAN MAS ON - 
PFIZER MONKEY VIRUS 

The results depicted herein illustrate the 

results of search motifs conducted on the Simian 

Mason-Pfizer monkey virus. The motifs reveal 

25 DP178/DP107 analogs within the enveloped (TM) protein 
GP20, as shown in FIG. 36. 

The 107x178x4 motifs identifies a region at amino 
acid residues 422-470. The ALLM0TI5 finds a region at 
amino acid residues 408-474. The Lupas program 

30 predicted a coiled-coil structure a amino acids 424- 
459. 
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24. EXAMPLE: COMPUTER-ASSISTED IDENTIFICATION OP 
DP178/DP107 ANALOGS IN BACTERIAL 
PROTEINS 

The results presented herein demonstrate the 
identification of DP178/DP107 analogs corresponding 
s to sequences present in proteins of a variety of 
bacterial species. 

FIG. 37 depicts the search motif results for the 
Pseudomonas aeruginosa fimbrial protein (Pilin) . Two 
regions were identified by motifs 107x178x4 and 
ALLMOTI5. The regions located at amino acid residues 
10 30-67 and 80-144 were identified by the 107x178x4 
motif. The regions at amino acid residues 30-68 and 
80-125 were identified by the ALLMOTI5. 

FIG. 38 depicts the search motif results for the 
Pseudomonas gonorrhoeae fimbrial protein (Pilin) . A 
15 single region was identified by both the 107x178x4 and 
the ALLM0TI5 motifs. The region located at amino acid 
residues 66-97 was identified by the 107x178x4 motif. 
The region located at amino acid residues 66-125 were 
identified by the ALLM0TI5 search motif. No coiled- 
2Q coil regions were predicted by the Lupas program. 

FIG. 39 depicts the search motif results for the 
Hemophilus Influenza fimbrial protein (Pilin) . a 
single region was identified by both the 107x178x4 and 
the ALLMOTI5 motifs. The region located at amino acid 
residues 102-129 was identified by the 107x178x4 
25 motif. The region located at amino acid residues 102- 
148 were identified by the ALLM0TI5 search motif. No 
coiled- coil regions were predicted by the Lupas 
program. 

FIG. 40 depicts the search motif results for the 
30 Staphylococcus aureus toxic shock syndrome Hemophilus 
Influenza fimbrial protein (Pilin) . A single region 
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was identified by both the 107x178x4 and the ALLMOTI5 
motifs. The region located at amino acid residues 
102-129 was identified by the 107x178x4 motif. The 
region located at amino acid residues 102-148 were 
identified by the ALLMOTI5 search motif. No coiled- 
5 coil regions were predicted by the Lupas program. 

FIG. 41 summarizes the motif search results 
conducted on the Staphylococcus aureus enterotoxin 
Type E protein. These results demonstrate the 
successful identification of DP178/DP107 analogs 
10 corresponding to peptide sequences within this 
protein, as described below. 

The ALLMOTI5 motif identified a region at amino 
acid residues 22-27. The 107x178x4 motif identified 
two regions, with the first at amino acid residues 26- 
69 and the second at 88-115. A P12L2IPC motif search 

15 

identified two regions, at amino acid residues 163-181 
and 230-250. 

The Lupas program predicted a region with a high 
propensity for coiling at amino acid residues 25-54. 
This sequence is completely contained within the first 
20 region identified by both ALLMOTI5 and 107x178x4 
motifs. 

FIG. 42 depicts the search motif results 
conducted on a second Staphylococcus aureus toxin, 
enterotoxin A. Two regions were identified by the 
25 ALLM0TI5 motif, at amino acid residues 22-70 and amino 
acid residues 164-205. The 107x178x4 motif found two 
regions, the first at amino acid residues 26-69 and 
the second at amino acid residues 165-192. A P23LZIPC 
motif search revealed a region at amino acid residues 
216-250. No coiled-coil regions were predicted by the 

30 

Lupas program. 
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FIG. 43 shows the motif search results conducted 
on the E. coli heat labile enterotoxin A protein, 
demonstrating that identification of DP178/DP107 
analogs corresponding to peptides located within this 
protein. Two regions were identified by the ALLMOTI5 
motif, with the first residing at amino acid residues 
55-115, and the second residing at amino acid residues 
216-254. The 107x178x4 motif identified a single 
region at amino acid residues 78-105. No coiled-coil 
regions were predicted by the Lupas program. 

25. EXAMPLE: COMPUTER-ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS WITHIN VARIOUS 
HUMAN PROTEINS 

The results presented herein demonstrate the 
identification of DP178/DP107 analogs corresponding to 
15 peptide sequences present within several different 
human proteins. 

FIG. 44 illustrates the search motif results 
conducted on the human c-fos oncoprotein. The 
ALLM0TI5 motif identified a single region at amino 

2o acid residues 155-193. The 107x178x4 motif identified 
one region at amino acid residues 162-193. The Lupas 
program predicted a region at amino acid residues 148- 
201 to have coiled-coil structure. 

FIG. 45 illustrates the search motif results 
conducted on the human lupus KU autoantigen protein 

25 P70. The ALLMOTI5 motif identified a single region at 
amino acid residues 229-280. The 107x178x4 motif 
identified one region at amino acid residues 235-292. 
The Lupas program predicted a region at amino acid 
residues 232-267 to have coiled-coil structure. 

30 FIG. 46 illustrates the search motif results 

conducted on the human zinc finger protein 10. The 



- 154 - 



WO 01/51673 



PCT/US00/35727 



ALLM0TI5 motif identified a single region at amino 
acid residues 29-81. The 107x178x4 motif identified 
one region at amino acid residues 29-56. A P23LZIPC 
motif search found a single region at amino acid 
residues 420-457. The Lupas program predicted no 
5 coiled-coil regions. 

26. EXAMPLE: POTENTIAL MEASLES VIRUS DP178/DP107 
ANALOGS: CD AND ANTIVIRAL 
CHARACTER I Z ATI ON 

In the Example presented herein, measles (MeV) 

10 virus DPl78-like peptides identified by utilizing the 

computer-assisted search motifs described in the 

Examples presented in Sections 9 and 21, above, are 

tested for anti-MeV activity. Additionally, circular 

dichroism (CD) structural analyses are conducted on 

15 the peptides, as discussed below. It is demonstrated 

that several of the identified peptides exhibit potent 

antiviral capability. Additionally, it is shown that 

none of the these peptides exhibit a substantial 

helical character. 

20 

26.1 MATERIALS AND METHODS 
Structural analyses : The CD spectra were 
measured in a lOmM sodium phosphate, 150mM sodium 
chloride, pH 7.0, buffer at approximately lOmM 
concentrations, using a 1 cm pathlength cell on a 
25 Jobin/Yvon Autodichrograph Mark V CD 

spectrophotometer. Peptide concentrations were 
determined from A^ using Edlehoch's method (1967, 
Biochemistry 6:1948} . 

Anti-MeV antivi ral activity syncytial reduction 
3 0 assay: The assay utilized herein tested the ability 
of the peptides to disrupt the ability of Vero cells 
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acutely infected with MeV ( i.e. , cells which are 
infected with a multiplicity of infection of 2-3) to 
fuse and cause syncytial formation on a monolayer of 
an uninfected line of Vero cells. The more potent the 
peptide, the lower the observed level of fusion, the 
5 greater the antiviral activity of the peptide. 

Uninfected confluent monolayers of Vero cells 
were grown in microtiter wells in 10% FBS EMEM (Eagle 
Minimum Essential Medium w/o L-glutamine [Bio 
Whittaker Cat. No. 12-125F] , with fetal bovine serum 

10 [FBS; which had been heat inactivated for 30 minutes 
at 56°C; Bio Whittaker Cat. No. 14-501F)' supplemented 
at 10%, antibiotics/antimycotics (Bio Whittaker Cat. 
No. 17-602E) added at 1%, and glutamine added at 1%. 
To prepare acutely infected Vero cells for 

^ addition to the uninfected cells, cultures of acutely 
infected Vero cells were washed twice with HBSS (Bio 
Whittaker Cat. No. 10-543F) and cell monolayers were 
removed with trypsin (Bio Whittaker Cat. No. 17-161E) . 
Once cells detached, media was added, any remaining 
clumps of cells were dispersed, and hemacytometer cell 

20 counts were performed. 

The antiviral assay was conducted by, first, 
removing all media from the wells containing 
uninfected Vero cells, then adding peptides (at the 
dilutions described below) in 10% FBS EMEM, and 50-100 

25 acutely MeV-infected Vero cells per well. Wells were 
then incubated at 37°C for a maximum of 18 hours. 

On day 2, after cells in control wells were 
checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of 

3q approximately 50^1 0.25% Crystal Violet stain in 
methanol. Wells were rinsed twice with water 
immediately, to remove excess stain and were then 
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allowed to dry. The number of syncytia per well were 
then counted, using a dissecting microscope. 

Ant i -MeV antiviral activity plague reduction 
assay : The assay utilized herein tested the ability 
of the peptides to disrupt the ability of MeV to 
5 infect permissive, uninfected Vero cells, leading to 
the infected cells 1 fusing with uninfected cells to 
produce syncytia. The lower the observed level of 
syncytial formation, the greater the antiviral 
activity of the peptide. 
10 Monolayers of uninfected Vero cells are grown as 

described above. 

The antiviral assay was conducted by, first, 
removing all media from the wells containing 
uninfected Vero cells, then adding peptides (at the 
dilutions described below) in 10% FBS EMEM, and MeV 

15 

stock virus at a final concentration of 3 0 plaque 
forming units (PFU) per well. Wells were then 
incubated at 37°C for a minimum of 3 6 hours and a 
maximum of 48 hours. 

On day 2, after cells in control wells were 

20 checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of 
approximately 50/il 0.25% Crystal Violet stain in 
methanol. Wells were rinsed twice with water 
immediately, to remove excess stain and were then 

25 allowed to dry. The number of syncytia per well were 
then counted, using a dissecting microscope. 

Peptides : The peptides characterized in the 
study presented herein were peptides T-252A0 to T- 
256A0, T-257B1/C1, and T-258B1 to T-265B0, and T-266A0 
to T-268AQ, as shown in FIG. 47. These peptides 

30 

represent a walk through the DPl78-like region of the 
MeV fusion protein. 
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Each peptide was tested at 2 -fold serial 
dilutions ranging from 100pg/ml to approximately 
lOOng/ml. For each of the assays, a well containing 
no peptide was also used. 

5 26.2 RESULTS 

The data summarized in FIG. 47 represents 
antiviral and structural information obtained via 
"peptide walks" through the DPl78-like region of the 
MeV fusion protein. 
10 As shown in FIG. 47 , the MeV DP178-like peptides 

exhibited a range of antiviral activity as crude 
peptides. Several of these peptides were chosen for 
purification and further antiviral characterization. 
The IC 50 values for such peptides were determined, as * 
shown in FIG. 47, and ranged from 1.35/xg/ml (T- 
257B1/C1) to 0.072/ig/ml (T-265B1) . None of the DP178- 
like peptides showed, by CD analysis, a detectable 
level of helicity. 

Thus, the computer assisted searches described, 
hereinabove, as in for example, the Example presented 

2 0 

in Section 9, for example, successfully identified 
viral peptide domains that represent highly promising 
anti-MeV antiviral compounds. 

27. EXAMPLE: POTENTIAL SIV DP178/DP107 ANALOGS: 
25 ANTIVIRAL CHARACTERIZATION 

In the Example presented herein, simian 

immunodeficiency virus (SIV) DP178-like peptides 

identified by utilizing the computer-assisted search 

motifs described in the Examples presented in Sections 

9, 12 and 19, above, were tested for anti-SIV 

*° activity. It is demonstrated that several of the 
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identified peptides exhibit potent antiviral 
capability. 

27.1 MATERIALS AND METHODS 
Anti-SIV antiviral assays : The assay utilized 
5 herein were as reported in Langolis et al. (Langolis, 
A.J. et al., 1991, AIDS Research and Human 
Retroviruses 7:713-720). 

Peptides : The peptides characterized in the 
study presented herein were peptides T-391 to T-400, 
10 as shown in FIG. 48. These peptides represent a walk 
through the DP178-like region of the SIV TM protein. 

Each peptide was tested at 2 -fold serial 
dilutions ranging from 100/xg/ml to approximately 
lOOng/ml. For each of the assays, a well containing 
^ no peptide was also used. 

27.2 RESULTS 
The data summarized in FIG. 48 represents 
antiviral information obtained via "peptide walks" 
through the DP178-like region of the SIV TM protein. 
20 As shown in FIG. 48, peptides T-391 to T-400 were 

tested and exhibited a potent antiviral activity as 
crude peptides. 

Thus, the computer assisted searches described, 
hereinabove, as in for example, the Example presented 
25 in Section 9, for example, successfully identified 

viral peptide domains that represent highly promising 
anti-SIV antiviral compounds. 

28. EXAMPLE: ANTI-VIRAL ACTIVITY OF DP107 AND DP- 
178 PEPTIDE TRUNCATIONS AND MUTATIONS 

The Example presented in this Section represents 

a study of the antiviral activity of DP107 and DP178 
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truncations and mutations, it is demonstrated that 
several of these DP107 and DP178 modified peptides 
exhibit substantial antiviral activity. 

28.1 MATERIALS AND METHODS 

5 Ant i -HIV assays: The antiviral assays performed 

were as those described, above, in Section 6.1. 
Assays utilized HIV-l/liib and/or HIV- 2 NIHZ isolates. 
Purified peptides were used, unless otherwise noted in 

FIGS. 49A-C. 

10 Peptides : The peptides characterized in the 

study presented herein were: 

1) FIGS. 49A-C present peptides derived from 
the region around and containing the DP178 
region of the HIV-1 BRU isolate. 

15 Specifically, this region spanned from gp41 

amino acid residue 615 to amino acid residue 
717. The peptides listed contain 
truncations of this region and/or mutations 
which vary from the DP178 sequence amino 
acid sequence. Further, certain of the 
peptides have had amino- and/or carboxy- 
terminal groups either added or removed, 
as indicated in the figures; and 

2) FIG. 50. presents peptides which represent 
truncations of DP107 and/or the g P 4l region 

!5 surrounding the DP107 amino acid sequence of 

HIV-l BRU isolate. Certain of the peptides 
are unblocked or biotinylated, as indicated 
in the figure. 
Blocked peptides contained an acyl N- terminus and 
an amido C-terminus. 
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28.2 RESULTS 
Ant i -HIV antiviral data was obtained with the 
group 1 DP178-derived peptides listed in FIG. 49A-C. 
The full-length, non-mutant DP178 peptide (referred to 
in FIG. 49A-C as T2 0) results shown are for 4ng/ml. 
5 In FIG. 4 9A, a number of the DP178 truncations 

exhibited a high level of antiviral activity, as 
evidenced by their low IC S0 values. These include, for 
example, test peptides T-50, T-624, T-636 to T-641, T- 
645 to T-650, T-652 to T-654 and T-656. T-50 
10 represents a test peptide which contains a point 
mutation, as indicated by the residue's shaded 
background. The HIV- l -derived test peptides exhibited 
a distinct strain-specific antiviral activity, in that 
none of the peptides tested on the HIV-2 NIHZ isolate" 
demonstrated appreciable antti-HIV-2 antiviral 

15 

activity. 

Among the peptides listed in FIG. 49B, are test 
peptides representing the amino (T-4) and carboxy (T- 
3) terminal halves of DP178 were tested. The amino 
terminal peptide was not active (IC 50 >4 00/xg/ml) whereas 

20 the carboxy terminal peptide showed potent antiviral 
activity (IC 50 = 3/zg/ml) . A number of additional test 
peptides also exhibited a high level of antiviral 
activity. These included, for example, T-61/T-102, T- 
217 to T-221, T-235, T-381, T-677, T-377, T-590, T- 

25 378, T-591, T-271 to T-272, T-611, T-222 to T-223 and 
T-60/T-224. Certain of the antiviral peptides contain 
point mutations and/or amino acid residue additions 
which vary from the DP178 amino acid sequence. 

In FIG. 49C, point mutations and/or amino and/or 

3Q carboxy- terminal modifications are introduced into the 
DP178 amino acid sequence itself. As shown in the 
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figure, the majority of the test peptides listed 
exhibit potent antiviral activity. 

Truncations of the DP107 peptide (referred to in 
IG. 50 as T21) were also produced and tested, as shown 
in FIG. 50. FIG. 50 also presents data concerning 
5 blocked and unblocked peptides which contain 

additional amino acid residues from the gp4l region in 
which the DP107 sequence resides. Most of these 
peptides showed antiviral activity, as evidenced by 
their low IC S0 values. 
10 Thus, the results presented in this Section 

demonstrate that not only do the full length DP107 and 
DP178 peptides exhibit potent antiviral activity, but 
truncations and/or mutant versions of these peptides 
can also possess substantial antiviral character. 



15 



20 



29: EXAMPLE: POTENTIAL EPSTEIN-BARR DP178/DP107 

ANALOGS : ANTIVIRAL CHARACTERIZATION 

In the Example presented herein, peptides derived 
from the Epstein-Barr (EBV) DP-178/DP107 analog region 
of the Zebra protein identified, above, in the Example 
presented in Section 2 0 are described and tested for 
anti-EBV activity, it is demonstrated that among 
these peptides are ones which exhibit potential anti- 
viral activity. 



25 29 - 1 MATERIALS AND METHODS 

Elect rophoretic Mobil ity, Shift- Assays (emsa) . 
Briefly, an EBV Zebra protein was synthesized 
utilizing SP6 RNA polymerase in vitro transcription 
and wheat germ in vitro translation systems (Promega 
Corporation recommendations; Butler, E.T. and 
Chamberlain, M.J., i 98 4, J. Biol. Chem. 257:5772.- 
Pelham, H.R.B. and Jackson, R.J., 1976, Eur. J. 
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Biochem. 62:247) . The in vitro translated Zebra 
protein was then preincubated with increasing amounts 
of peptide up to 250 ng/ml prior to the addition of 
10,000 to 20,000 c.p.m. of a 32 P-labeled Zebra response 
element DNA fragment. After a 20 minute incubation in 
5 the presence of the response element, the reaction was 
analyzed on a 4% non- denaturing polyacrylamide gel, 
followed by autoradiography, utilizing standard gel- 
shift procedures. The ability of a test peptide to 
prevent Zebra homodimer DNA binding was assayed by the 
10 peptide's ability to abolish the response element gel 
migration retardation characteristic of a protein- 
bound nucleic acid molecule. 

Peptides : The peptides characterized in this 
study represent peptide walks through the region 
containing, and flanked on both sides by, the 

15 

DP178/DP107 analog region identified in the Example 
presented in Section 20, above, and shown as shown in 
FIG. 33. Specifically, the peptide walks covered the 
region from amino acid residue 173 to amino acid 
residue 246 of the EBV Zebra protein. 
20 Each of the tested peptides were analyzed at a 

range- of concentrations, with 150ng/ml being the 
lowest concentration at which any of the peptides 
exerted an inhibitory effect. 



25 29.2 RESULTS 

The EBV Zebra protein transcription factor 
contains a DP178/DP107 analog region, as demonstrated 
in the Example presented, above, in Section 20. This 
protein appears to be the primary factor responsible 

3q for the reactivation capability of the virus. A 

method by which the DNA-binding function of the Zebra 
virus may be abolished may, therefore, represent an 
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effective antiviral technique, in order to identify 
potential anti-EBV DP178/DP107 peptides, therefore, 
peptides derived from the region identified in Section 
20, above, were tested for their ability to inhibit 
Zebra protein DNA binding. 

The test peptides' ability to inhibit Zebra 
protein DNA binding was assayed via the EMSA assays 
described, above, in Section 28.1. The data 
summarized in PIG. 51A-B presents the results of EMSA 
assays of the listed EBV test peptides. These 
peptides represent one amino acid "walks" through the 
region containing, and flanked on both sides by, the 
DP178/DP107 analog region identified in the Example 
presented in Section 20, above, and shown as shown in 
FIG. 33. As shown in FIG. 51A-B, the region from 
which these peptides are derived lies from EBV Zebra 
protein amino acid residue 173 to 246. A number of 
the test peptides which were assayed exhibited an 
ability to inhibit Zebra protein homodimer DNA 
binding, including 439, 441, 444 and 445. 

Those peptides which exhibit an ability to 
inhibit zebra protein DNA binding represent potential 
anti-EBV antiviral compounds whose ability to inhibit 
EBV infection can be further characterized. 

30. EXAMPLE: IDENTIFICATION OF RSV DP107/DP178 
ANALOGS WITH REDUCED BINDING 
AFFINITY 



In the example presented herein, peptides derived 
from the RSV DP178 analog T112 are described and 
tested for binding affinity to the DPl07-like domain 
of the RSV Fl -protein. Particular peptides are 
identified that have a reduced binding affinity for 
their DPl07-like target, and key amino acid residues 
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are identified the confer high binding affinity to the 
native peptide (i.e., to T112) . Such peptides are 
useful, e.gr., in screening assays such as those 
described above in Section 5.6.1 to identify compounds 
which inhibit or disrupt the interaction between DP107 
5 and DP178, and in providing guidance for generation of 
additional peptides exhibiting reduced affinity 
binding. 

3 0.1 MATERIALS AND METHODS 
10 A maltose binding fusion protein of the RSV Fl- 

protein (MF5.1) was constructed using methods similar 
to those described in Section 8.1.2, supra, for 
construction of the M41 fusion protein. Specifically, 
the DNA sequence corresponding amino acid residues 
15 142-302. of the RSV Fl protein was amplified by PCR and 
cloned into the Xmn I site of the expression vector 
pMal-p2 (New England Biolab) to give MF5.1. These 
amino acid residues correspond to the extracellular 
domain of the RSV Fl protein including its DP107 
region but excluding the DP178 region. 

20 

The peptides characterized in the study presented 
herein were: T122, T800, T801, T802, T803, T804, 
T805, T806, T807, T808, T809, T810, T811, T1669, 
T1670, T1671, T1672, T1673 , T1680, T1681, T1682, T1683 
and T1684, as shown in FIG. 53. T112 represents the 

25 DPl78-like region of the RSV Fl protein. The other 
peptides characterized are modified DP178 proteins 
derived from T112. 

Cell fusion assays were performed with each of 
the peptides as described in Section 17 above. The 

3 0 bindin 9 affinity of each peptide was also measured in 
a competitive binding assay described in Section 5.6.1 
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above, wherein the concentration of each peptide 
necessary to bind to the MS.l fusion protein (i.e., 
the B 50 value) , and thereby disrupt binding of biotin 
labeled T112 (T888). to the fusion protein, was 
measured. 

30.2 RESULTS 
T112 is a 35 amino acid residue peptide that 
corresponds to amino acid residues 482-516 of the RSV 
Fl protein and has the following amino acid sequence: 

VFPSDEFDASISQVNEKINQSLAFIRKSDELLHNV 



The peptide represents the DPi78-like region of the 
RSV Fl protein and has substantial, antiviral activity 
15 against RSV as discussed in Section 17.2 above and 
shown in FIG. 28A. 

T112 analogs were generated according to at least 
three different strategies to generate peptides based 
on T112 that would still bind to the DPl07-like domain 
of the RSV Fl protein but with a lower binding 
affinity. First, a truncated peptide was generated, 
reducing the length of the peptide from 35 to 28 amino 
acid residues. Specifically, the truncated peptide, 
which is referred to herein as T67, had the amino acid 
sequence : 



DE FDAS IS QVNEKINQSLAF IRKSDELL 



corresponding to amino acid residues 486-213 of the Fl 
fusion protein. The binding affinity of the peptide 
to the DPl07-like domain of Fl protein was determined 
according to the methods described in Section 5.6.1 



166 - 



WO 01/51673 



PCIYUS00/35727 



above. The truncated peptide had a binding affinity 
(5 nM) that was five times lower than that of the full 
length T112 peptide (2 nM) . 

As part of a second strategy, the peptides 
identified as T800 through T811 in FIG. 53 were 
5 synthesized to identify particular amino acids in T112 
that contribute to a larger part of that peptide's 
binding affinity. As a whole, these alanine 
substitutions represent an n alanine -scanning" type 
walk across the sequence of T112 . 

10 Each of the peptides synthesized had a change of 

three consecutive amino acid residues in the T112 
sequence to three alanine residues. Each peptide was 
tested for its ability to inhibit the binding of the 
native peptide (i.e., of T112) in a competitive 
binding assay as described in Section 5.6.1 above. 
The results are also shown in FIG. 53. In particular, 
the peptides T802, T804, T807 and T810 had 
significantly reduced affinity for the DPl07-like 
target, suggesting that the regions containing amino 
acid residues 488-490, 494-496, 503-505 and 512-514 of 

20 the RSV Fl protein (amino acid residues 7-9, 13-15, 
22-24 and 31-33, respectively, of T112) , contribute 
significantly to the high binding affinity of T112 for 
its DP107-like target in the RSV Fl protein. 

The peptides T1669-T1673 and T1680 through T1684 

25 were then synthesized, each of which contains a single 
alanine substitution at one of the above -listed amino 
acid residue positions of T112. The binding affinity 
of these peptides for their DP107-like target can also 
be determined by means of the same routine screening 

3Q assays, thereby identifying individual amino acid 
residues which affect binding affinity of T112. 
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In addition, an additional novel peptide, 
referred to as T786, was generated by modifying 
various amino acid residues in the T112 sequence which 
were identified, using standard principles of protein 
and design, as affecting properties such as binding 
5 affinity, solubility and biological stability. 
Specifically the following amino acid residue 
substitutions were made: F 2 - y, S 21 - A, F 24 - Y and 
S 28 - A, wherein the subscript numerals indicate the 
amino acid residue position in T112 . The resultant 
10 peptide, which is referred to herein as T786, thus had 
the amino acid sequence: 



VYPSDEFDASISQVNEKINQALAYIRKADELLHNV 

15 The binding affinity of this novel peptide for 

the DP107 target was found to be 19 nM, i.e., 
approximately ten- fold less than the binding affinity 
of T112. 

The data demonstrates that peptides having a 
reduced binding affinity for a DP107 target (i.e., for 

2 0 

an HRl domain) may be readily found by modifying a 
DP178. peptide such as T112, e.g., by means of the 
routine techniques and assays described herein. 
Further, the techniques and assays identify key amino 
acid residues which may be used to construct and 
25 identify other reduced affinity peptides. 

31. EXAMPLE: IDENTIFICATION OF HIV DP107/DP178 
ANALOGS WITH REDUCED BINDING 
AFFINITY __ 

In the example presented herein, peptides derived 

30 from DP178, which is also referred to as T2 0, are 

described and tested for binding affinity to the DP107 
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domain of the HIV gp41. Particular peptides are 
identified that have a reduced binding affinity for 
their DP107 target, and key amino acid residues are 
identified the confer high binding affinity to the 
native peptide (i.e., to T20) . Such peptides are 
5 useful, e.g., in screening assays such as those 

described above in Section 5,6.1 to identify compounds 
which inhibit or disrupt the interaction between DP107 
and DP178. 

Specifically, the peptides identified as T813 and 

10 T868 through T878 in FIG. 53 were synthesized to 

identify particular amino acids in T2 0 (DP178) that 
contribute to a greater part of that peptide's binding 
affinity. Each of the peptides synthesized had a 
change of three consecutive amino acid residues in the 
T20 sequence to three alanine residues. The antiviral 
activity of each peptide was assayed in cell fusion 
assays as described in Section 6.1.3, above. The 
binding affinities of the peptides were also measured 
in a competitive binding assay described in Section 
5.6.1 above, wherein each peptides ability to disrupt 

20 the binding of either biotin (T83) or fluorescein 
(T1342) labeled DP178 (T20) to the M41A178 fusion 
protein described in Section 8, above, was measured. 
The binding affinity of each peptide to the peptide 
referred to as T764 

25 (GSTMGARSMTLTVQARQLLSGIVQQNNLLRAIEAQQH) also measured 
using circular dichroism to monitor the amount of 
secondary structure (i.e., helicity) adopted by the 
peptides. T764 is a peptide which represents the 
DP107 target domain of DP178 (T20) . 

30 The results are provided in FIG. 54. In 

particular, the peptides T813, T878, T874-T876 and 
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T871 have significantly reduced affinity for the DP107 
region, suggesting the regions corresponding to the 
substituted amino acid residues in those peptides 
contribute significantly to the high binding affinity 
of T20. The peptides T1627-T1632 , T1650-T1653 and 
5 T1656-T1665 were then synthesized. Each of these 

peptides contains a single alanine substitution at one 
of the amino acid residues in one of the regions 
identified as contributing significantly to the high 
binding affinity of T20. Identical assays which 
10 measured the binding affinity of these peptides 

identified four essential residues (I M6 , Q SS2 , q m3 and 
N 656 , with the subscript numerals indicating the 
residue position in the HIV-l^ gp4l amino acid 
sequence) in which alanine -substitution completely ' " 
i5 prevented binding to the DP107 domain, as well as five 
residues (L 641 , i 642 / i 64S , e S57 and L Sfi3 , with the 
subscript numerals indicating the residue) in which 
alanine-substitution position in the HIV-l^ gP 4i 
amino acid sequence) that reduced the binding affinity 
but did not actually block binding to the DP107 
20 domain. 

The data demonstrates that peptides having a 
reduced binding affinity for a DP107 target (i.e., for 
an HR1 domain) may be readily found by modifying a 
DP178 peptide such as T20, e.g., by means of the 
25 routine techniques and assays described herein. 

Further, the techniques and assays identify key amino 
acid residues which may be used to construct and 
identify other reduced affinity peptides. 



30 
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The present invention is not to be limited in 
scope by the specific embodiments described which are 
intended as single illustrations of individual aspects 
of the invention, and functionally equivalent methods 
and components are within the scope of the invention. 
5 Indeed, various modifications of the invention, in 
addition to those shown and described herein will 
become apparent to those skilled in the art from the 
foregoing description and accompanying drawings. Such 
modifications are intended to fall within the scope of 
10 the appended claims. 
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WHAT IS CLATMED IS: 

1. A method for identifying a compound that 
inhibits the formation of or disrupts a DP107/DP178 
5 complex comprising: 

(a) preparing, both in the presence and in the 
absence of a test compound, a reaction 
mixture containing a DP107 peptide and a 
DP178 peptide under conditions and for a 

10 time sufficient to allow formation of a 

DP107/DP178 complex; and 

(b) detecting the formation of a DP107/DP178 
complex both in the presence and in the 
absence of the test compound, 

is wherein the formation of a DP107/DP178 complex in the 
absence, but not in the presence of the test compound 
indicates that the compound inhibits the formation of 
or disrupts a DP107/DP178 complex. 
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2. The method of Claim l, wherein the DP107 
peptide or the DP178 peptide is a modified DP107 or 
DP178 peptide. 

3. The method of Claim 2 wherein the modified 
DP107 or DP178 peptide has a reduced binding affinity. 
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GSQEWYTTVPK YVATQG YL I SNFDESSCTFMPEGTVCSQNAL YPMSPLLQECL 

RGSTKSCARTLVSGSFGNRFILSQGNLIANCASILCKCYTTGTIINODPOKILTYIAA 

*P23LZIPC* 
♦P12LZIPC* 
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FUSION 
PEPTIDE 

*P12&23LZIPC* 
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*LVS PREDICTED COILED-COIL* 
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